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ABSTRACT 


Provided herein are methods for prognosing and diagnosing 
fat deposition and related disorders (e.g., obesity and non- 
insulin diabetes dependent meUitus (NIDDM)) in a subject, 
reagents and kits for carrjriog out the methods, methods for 
identifying candidate therapeutics for reducing fat deposi- 
tion and related disorders, and therapeutic methods for 
reducing fat deposition or treating fat deposition related 
disorders in a subject. These embodiments are based in part 
upon an analysis of polymorphic variations of the nucleic 
add set forth in SEQ ID NO:l. 
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1 aggatcagct tctttggtca ggcctaatga tgttaccaac tagtaattta taaaaacttg 
61 catttctaca tttttcttaa ctgaccccat tccaaaacac aagcaaacca tctaaattca 
121 agatttgtga ggttgggact aactcattac tatagggtaa gtatgaatga aggctggaaa 
181 tgtgttttga ttgctgccat tattttttat ttttattttt tctgagatgg agtctcactc 
241 ttgtcgccca ggctggagtg caatggcgtg atcagctcat ggcaacctcc acctccccgg 
3 01 ttcaagcaat tctcctgcct cagcctcctg agtagctgag attacaggca tgcgccacca 
361 cgcccagcta atttttgtat ttttagtaga gaeggggttt ctccattgtt ggtcaggctg 
421 gtctcgaact cccaatgtca ggtgatccac ceaccttggc ctgccaaagt gctgggatta 
481 aaggtgtaag ccacgtgcct ggtgtttttt ggggtttgtt tgtttgtttt gagatggagt 
541 ctgtcaccca ggctggagtg cagtggcaca atttcagctc actgcaacct ccgcctccca 
601 ggttcaagca attcttcctg cctcagcctc ccgagtagct gggattacag gcacctgcca 
661 taaggcccag ctaagttttC tatattttca gtagagatga ggtttcacca tgttggccag 
721 gctggtctcc agctcctgac ctcacatgat catccgcctg cctcggcctc ccaaagtgct 
781 gcagttatag gtgtgagcca ccgtgcccag ccatgttttg attcttaagg aaatgatttt 
841 taagtcccct tccaatgagg atttagcaat ggagttttct tcttctacag ataatttgag 
901 gcttcccaat tgcaccaaac tccatttggg cagatacctc acctgtccac tccatagccc 
961 tagcatcaag aaaagtgcca agcacacaac aggcactcag taaatttctg tcaaatgaat 
1021 gaataaaata ccaccacctt ttcctgagac cattttggac caaggctggc atgggcatct 
1081 ctgttcctcc ctctaactta agaattccat ctcttagagg cagatgctgc cbgttttctg 
1141 caaattttat tattcacctt tctatatttg gaaaaacaag ttggtgtggc ttacttagga 
1201 atctcatatc caggaaaCga gctaaaaatg ttgtatcaaa gattatgcat aggatccaca 
1261 aactaagaag caagcagatg agagcggtta gagacatcgt ggtatatttt ccggaagaaa 
1321 tggtccatgg attaaaaatt gtcattatga tgagtctgca gtaacacaga aaccacgtag 
1381 cgcgaatcag aaaatgaaaa gcacagccat tgactatgat tacaaaccaa aactttatac 
1441 caattacctg ttggaattta tgctgcatga aggcaggata ttttgcttgt cctgttcact 
1501 ggacagtact tggcataaag gatgtgctta tgaacatttt ctgggaaact aaatataaca 
1561 ctcacctgcg aaaacacatc tgggccaggc gcagtggctc acgcctgtaa tcccaacact 
1621 ttgggagccc gaggcgggtg gatcatggga ggtcaggagc tggagatcag cctggccaac 
1681 atggtgaaaa cctgtctcta ataaaaatac aaaaattagc tgggcgtggt ggcgggcact 
1741 tgtaatccca gctactcaag aggctgaggc aggagaattg cttgaacctg ggaggcggag 
1801 gttacagtga gccgagatcg cgccactgca ctccagactg gacaagagtg aaactccgtc 
1861 tcaaaaaaaa aaagaaaaag aaaaagaaaa aacgtatctg aagggaatga acaaatacac 
1921 atgcggaaaa ttgtattcat ggggtagaac ggtggctgtt tccccccctt tcactatttt 
1981 ctctcctaaa gttgtgatag cattttctca gtggaaagga tgaccacaga attatctccc 
2041 ctgggctgca actaaggaag tatttggagt gagaatgcga ccaggcttca aatccaggag 
2101 gggtatttta cagggcgtgt ggctctcacc cagggctcac aggacacctg atatttaggc 
2161 tgttccccaa gaggctaagg acagcgcggc cccagctccc ccaaccacag ctgcccaagt 
2221 ttaggagccc gcactcgtga gcgatggcta ctccgcacag ccccggggtc ctctacggaa 
22B1 atccccggca cctatcagta cccctaaggg aggtcgctcg ttacccctca cttgaactct 
2341 gtcccaaatg ggaggtcact ctctcttcgc ggaccccatc ttccccgaaa gggtctcttg 
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24 01 tcctctctgg ccctcctgag aaggtcatta ctccccaact cgcctttttt gttgtttttt 
2461 ggtttctggt ttattgaggc agggtctcgc ttcatctccc agactggagt gcagtggcgc 
2 521 gatcacagct cacacggcag cctctacctc ctgggctcaa gcgatcctct cacctcagcc 
2581 tccctgggac tacaggcgag cgccactacg cccgcttttt tttttttttt tttttttttt 
2641 ttgtattttt gatagagacg aggtctcgct atgttgtcca ggctggtctc taactcaaga 
2701 ttcctctcac ctcggcctcc gcaagfegctg ggatgacagg tgtgagccac cgcccccggc 
2761 ccctcgcccg ccttttgaag gagcctttcg tcctcaaggg cgaggccact ccccccccgc 

2 821 gagttccatg ccccctagag ggtcatcgtt cccgacgggg aggtggcgcc ctcccccGGG 
2881 CCCCGGGCCC Cgaccgcccg tgctgcctcc ttccgggccc tcctccgcga tgacggcgcc 
2941 gccagcaggc caYgcggact gggcggggct ccgagcgggg actgggaccc agaccgacta 
3001 ggggactsrgg agrcgrggcgrgrc gcggcceitgs cgggctigctig cKccgcgctg gcggccttcc 
3061 tgttcgagta cgacacsrccg cgcatcg-tgrc tcatacgcag ccgcaaagtg gggctcatga 
3121 accgcgccgt gcaactgctc atcctggcct acgtcatcgg gtgagcgtgg ggccgcgcgg 
3181 ggggcgcggc gggtgctgcc ctcgcgtccg cgccgtgcgg cggctcatcc tggcctcggt 
3241 cacctgggcg agtccgggag cggcgagccg aggcggtgac accttccctg ggccccagcc 
3301 gccgcgccgg ggccccgggg gcgggaggct gcttgctgct gttttaaaac cacagcctgg 
3361 gccaggcgcg gtggcgcaag cctgtaatcc gagcactttg ggagcccgag gcgggaccat 
3421 cgcctgagcc caggagttcg aaaccaattt gggcaacatc gtgggacccc gtctctgcaa 
34 81 aaaattaaaa atcatccggg cctggtggcg cgcgcctggg gtcccagcta ctcaggaggc 

3 541 tgaggtggga ggatcgcttg agcccaagag gctgaggctg cagtgagcta tgatcacacc 
3601 acttaactcg agcctgggag acagtccgag acccgtctgt ttacaaaata aataaaggca 
3661 aagcctggct aggctgttac cctctcctgc ctgacccccc atagtgcccc ttggctgtag 
3721 gggcttcgcc ctgaacgtct caatcttcgc tctccacttc ccatcctgct gatttgctga 

3 781 caaaaccaaa gcttggaatt tccgcttggt tccttctggg tgggtggctt ttggtcattt 
3841 cttttgatca ctatgcggtg tcactatgtg gtagtagcga ggtcagactg tagcgagtgt 
3901 ttaaagtttg cttcctttgt tttctgggct tgtggggctt tttgtggtac ctgccctagc 
3961 ctagtcagtc attccccatg ctgccccctb aggctagaga tgccctaccg ccctcaggcc 

4 021 tcgctgaatg tgccattgta cttgaaggca cctgttactt ttttaatttt tttatttctt 
4081 tgagacggaa tttctctctt gtcgcccggg ctggagtgca atggcacgat ctcggctcac 
4141 tgcaacctct gcctcccggg ttcaagcaat tctcctgtct cagtctcctg agtggctggg 
4201 attacaagtg cccgccacca cgcccggcca atttttgtat tttttagtag agacggggtt 
4261 tcaggctggt cctgaactga cctcaggtga tccgcccacc tcagcctccc aaaatgctgg 
4321 gattacaggc gtgagccccc acgcggggct ggcacctgtt acaattgaat acaacgcgag 
4381 agagaagatg ataattacct tgccacctgg ggccacttta aaggcttacc tggggcgttc 
4441 tggtgtctac agctgctggg ctgacacgta ttgcctagtg cagtggtctc caaactgcac 
4501 attggaacca tctggaaatc tttggctccc atccccagac ccttttttat ttttttgatg 
4561 gagtctccct ctgtcgccca ggctggagtg cagtggtgca atcttggctc gctgcaacct 
4621 ccacctccct ggttcaagcg attctcctgc ctcagcctcc tgagtagctg ggattacagg 
4681 cacccaccac catgcccggg ctaatttttg tatttttagt agggacaggg ttccgccatg 
4741 ttggccaggc tggtctcgaa ctcctggcct caggtgatcc acccgccgca gcctcccaga 
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4 8 01 gtgctgggat tacaggcgtg agccaccgcg cccagacgca tccccagacc ttctgattta 
4861 attgctgcag agtatgatcg gctgcccagg tgattctcat gttcagcaac gtttcggaac 
4 921 caatgaactg tgtttcccaa gtctacctga tcatggaaga ctcatctgag gaattgttta 
4 981 aaagtaacag atccccccgc ctttggggtg aggtttggaa tctcaacact gcccaaggga 
5041 agcccaagca ggtttggctt ccaatatctg tcagcagcag cagtagatac agacagtatt 
5101 caagaggatg gcttfcccagg tgcagaccca acatcagaac cacctggccc acacccagat 
5161 ttggactcat tggctcacac ctgtagtccc agtgctttgg gaagccaagg tgggaggatt 
5221 gcttgaggcc aggagctcaa gaccagcctg gacacaagac cagcctggac aacatagcga 
52 Bl gaccccatct ctaagagaaa aataaagttt aaaatggtaa attgtactga tggagcagtg 
5341 caaaaaaaag aatttttttt tttttttttt ctgaacacgg tggctcatgc ctataatctc 
54 01 agcactttgg gaggccgagg cgggtggatc acctgaggtc aggagttcga gaccagcctg 
5461 gccaacatgg taaaaccccg tctctactaa aagtacaaaa aattagccag gcctagtggc 
5521 taatgcctgt aatcccggct acatgggagg ctgaggcagg agaatcgttt gaatccagga 
5581 ggcggaggtt gcagtgagcc cagattgcgc cactgcactc cagcctgggc aacaagagca 
5641 aaactccatc tcaaatatat atatatatat atatatatat atatatattt tttttttttt 
5701 tttggagact cactgctagc cagttaactg aactcacatc cctggttgga cctgaagcat 
5761 ttgggggcag gagctcccct gcagcttttt gcttttggcg tttgctgcta gtgggaggat 
5821 cagtgcacat ctaccttagg gtacctgtaa aattacaact gcggaaaata tccatagtgg 
5881 tgttgcactg tatggtagcc actagtaact tgcacctgtt gaacacttaa ccgttcctac 
5941 tctacactga gatgtgctat aaccttaaaa tacacactgg atttcaaaga catagtacag 
6001 aaaaagaggc tgggcacagt ggctcatgcc tgaaatcaca gtactttggg aggctgaggt 
6061 gggaagatcg cttgagctca agagcttgag acaaacctgg gcaacatagc gagaccttgc 
6121 ctctacaaaa aaatacaaaa attagcctgg catggtggca cacgcctata ggcctaccta 
6181 tttgggaggt tgaggcagaa ggactgctCg aacctgggag gttgaggcca cagagagctg 
6241 tgatcacacc a.ccctccagc ctgggcaaca gagcaagaca ctgtctcaaa aaaacaaaga 
6301 gtaccactat cacatgaaaa cattttttat tgatgacatg ttaaaatatt ttggatatat 
6361 tgggttacac agaatatatc attacaatta atttttcctg tctctttttt ctttttttct 
6421 ttttaaacgg cgccttgctt tgtcacccag actggagtgc aatgcactgc aacctctgcc 
6481 Ccccaggccc aagtgattct accatctcag cctctggaat tatgggacta caggtgcaca 
6541 ccaccatgac tggctaatta ttgtgttttt gtttgtttgt ttgtacagat gagaaatctc 
6601 atctctacaa tcatgaggtt tccatgttgc ccaggctggt ctcgaactcc tgagctctag 
6661 caatctgtct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgtgccca 
6721 gccctttcta tttcttttta ctttttgaca tggctactag aaacttttaa attacagggt 
6781 cttgccctgt ggtagctcac gcctgtaatc ccaatacttt gggaggctga ggcaagtgga 
6841 tcccttgagc ttaggaattt tgagaccagc ctgggcaacg tagtgaaacc ccatctctac 
6901 caaaaatacg aaaaattagc ttggcatagt ggcgcacacc tgtggaccca gctacttggg 
6961 aggctgaggt gggaggattg ctgtagcctg ggaggcggag gttgcagtga gccaagatcg 
7021 tgccactgca cttcagcctg ggcgacagag cgagagcctg tctcaaaaaa aaaaatatgt 
7081 ggctcgcgtt atgtttttat gggacagcgc agctctgtag gattccttat gacgcattca 
7141 ctatctcggt catgagaatc tttaaagcag taaatgcctt gcttttgtcc gtcttgctat 
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7201 aatcaaaact gactcaggac ctgcattaag gctctttttt tttttaacct catctcttaa 
7261 ctttttgatg caactcatct tagtgaaatc gtggtctcct cctacagcaa ccctccctca 
73 21 aaaaaagacc gaacacRcaa ttctcatcta tctaactctg ttaacaatca tatgaggagg 
7381 aaataatcac atgataatga gtggcatgaa accctgtgtc tctaaattaa gcaactcact 
7441 gtacttgatg agtacatcag tcttcctgcc ccaagctttt gtgtgtcaga ctggaattaa 
7501 atggaagctt acactgcagc tgaattccta gaagccctgg acttgtcagg ggaaaagccc 
7561 atggggggaa agcccatctc taggctgggc aatgaacttg gagactgctc gatcacacac 
7621 actcataagt aacaaaacca tcaatttctt ttcttttttt tggtggaggg acagagtctc 
7681 actctcgccc aggctggagt gcagtggcat gatctcagcc cactgcaatc tccacctccc 
7741 gggttcaagt gattctcctg cctcagcctc ccgagtatgt gggactacag gcacccacca 
7801 ccacacccag ctaattttgt gtatttttag tagagtcggg gtttcacaat gttggccagg 
7861 ctggtcttgg actcctgacc tcaacaaaac cagtgaattt ctgaagtctc agqatctagg 
7 921 acttagaaga aggagaagcc tagcatttcc tgattccata aataacaaca ctgatagtct 
7981 ctagtagatg tttaacagca gttcgaaggg gaggggttct gatttgtagt actggccaat 
8041 ttccacggtg tcagtacttg ctctgtgaGC aacgatgact cacaaacatt gttgtaaaca 
8101 tttgacctac attaattctt ttaccctcac aagaatcctg tgaagaccta ctatcattcc 
8161 cagattatag atgggaaaaa ctgaggcaca atgaggttaa gcaccttggc caaagttatg 
82 21 cctcgtgaat ggtggagctg agccatgaac caagacctct gcctccagtc tgtcctaacc 
8281 cctataagct aatgcctgta gatggttttt tttttttttt tttttttgag acagagcctt 
8341 gccctgtcgc ccaggccgga gtgcggtggc atcacagctc actgcagcct ccgtctccca 
8401 ggctcaagtg atcctcccac ctcagcctcc caaatagccg ggatttttat tggacagcac 
8461 agctctgtag gattccttat gacgtattca ctgtctcagC cacgagaatc tttaaagcag 
8521 taaatgcttt gcttttgtcc atcttgctat aatcaaaact gaccatgccc ggctaatttt 
8581 ttattttttg tagagatccc tatgttgccc aggcttgtct cgagctcttg ggcacaaatg 
8641 atcbtccccc cttgacctcc caaagttctg ggattacagg cgtgagccac tgtacccagc 
8701 ccctttggat ttttaattgt tcaaactggt agtcatagtc ccacttcaga ggagagaacc 
8761 atatatgtga ccagaattct tgggggaaaa gtaagaaact actcttaaaa gttccttaaa 
8821 gggtagaatt ctcttttcta gaaagagtag cctagggttg tccaatcttt tggctttcgt 
8881 gggccacact ggaaggagaa attatcttgg gccacacatt aaatatgcta acactaacaa 
8941 tagctgatga gctaaaaaac aaactttttt tgatcacaca aaaaaaaatc tcataacgtt 
9001 ttaagaaagt ttacgaattt gtgttgggcc tcatccaaag ccatcctggg ccgcatgcgg 
9061 cctgtgtgct atgggttaga caagcttaga gtagagcata gtgggtgaga gaagaggctt 
9121 cagatgcttc tggcttgagt tcaaatcctg actctgccac tcactagctg tgtgaecttt 
9181 ggcaagtgac ttaacctctc tgtgtttcca tttcctcctc tgtaaaatgg aaataataat 
9241 aatgatacct gtctcataaa gtggtcagga ttaaatgagt taatacatat aaagagctta 
9301 gatcagtgcc tggtgcctat aaacactcca gtggtaactg ttataatttg tattatctct 
9361 tggtgattcc cataggtcac aggcattgaa agtgctcaaa gggccaggtg cagtggctca 
94 21 cgcctgtaat cccagcactt tgggaggtcg aggtgggtgg atcacctgag gtcaggagtt 
94 81 cgagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaaataca aaattagcca 
9541 ggcgtgttgg cgcatgcctg taatcccagc tacttgggaa gctgaggcag gagaattgcc 
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9601 tgaacccggg aggcggaggt tgcagtgagc caagatcaca ccattgcact ccagcttagg 
96 61 caacaagagc gaaacatcgt ctcaaaaaaa aaaaaagaga aaatgctcaa atgtaataaa 
9721 gacagtattc atcgagcgtt ttctctgtgc catcagtgtg ctgagtgctt tacctgcacc 
9781 atctcattta tttcccgcaa accctcctgt ggtttaggtg ctaccatccc catttggcag 
9841 aagtggaaat ggagtcccct agaaggtacg tagcttgtag aaagtcatgc aattagtcaa 
9901 catcgtactt ccagccactg gccagcagaa gcttccccat caattcaaat ccattgaatc 
9961 ctaatggcaa cctaagaggc ctgttttatt ttatagsrtgrgr grtgrtttgtgt ggrsraaaaggg 
10021 ctaccag^aa actgactccg tgcftcagctc cgzcacgacc aaggticaagg gcgtggc€gt 
10081 gaccaacACt tctaaacttg gattccggat: ctgggatgcg gcgg^ttatg tgataccagc 
10141 tcaggtgtgt ctcccactgt gtcttctgtc taacactgac acctcgctct ttactgactt 
10201 tgcacacttg attaatgatt gacgtggcct gagtcctgac atcaccaatg cccagaatga 
10261 ttgttttagt ccaagaaaac ggtgcagcca gatctctttt cagcttagga tctaacagag 
10321 tgtatccagg cagctgtggt tcctcagctt gaaagcatgg caggctgaaa gccctgaaag 
10381 atgagaaaac aaatgtctta aatcctgata gaaattcaag tggaaatatc acctttccac 
10441 attttgggtt tgcattattg tcccaaatac ^cgaggtaaa tcttttcaga cagtgtaaaa 
10501 actttaggcc gggcacagta gctcacgtgt gtaatcccag cactt.tggga ggctggggcg 
10561 ggcagatacc ttgagcccag gagtttgaga ccagcctggg caatgtggtg aaaccccatc 
10621 tctataaaaa aaaaaaaaaa gaaaaaaaaa gaaaagaatt agagaccatt agagaccaaa 
10681 acccaaacaa gaaggaaaca gagacagcag cacaggactc catcgggagc gaaaggaagg 
10741 gaagacattg tttactgctg catactagtc ttcaggctgt taatctgcaa agctccaagt 
108 01 taaacattga ctctttagtc ctggtaacag ctctgcacca cacagtcaaa cccgacatag 
10861 ccctccagat ggcctatgtg ctaagggacc gagggcgacc agtagagata agctgccaac 
10921 tchtggctca gagctcttgc cagggaaaac aagaactcca gaaactcttt cttcaggtgC - 
10981 gttcttcccc ctgccttctc aagagaaggg tcttgtgggg tccccagagw ggagaccgag 
11041 ctaatgagcc ataaaagtca ggaggaaggg ccaggtgcgg tggctcatgc ctgtaatccc 
11101 agcactttgg gaggccgagg teggtgtatc atgaggtcaa gaga tt gaga ccatcctggc 
11161 taacatggtg aaaccctgtc tactaaaaat acaaaaatta gtggggtgtg gtggtgggtg 
112 21 cctgtagtcc cagctactag ggaggctgag gcaggagaat ctctcgaacc tgggaggcgg 
11281 aggttgtagt gagccgagat agtgccactg cactccagcc tggtgacaga gcaagactcc 
11341 gtctcaaaaa aaaagatcaa gtggaacagg ggatttgcag ttggctggag ggggeagtcc 
114 01 tgttagcatg tacctgtctc agcagagaga cggttgcctc ctgtgcaggc atagccttgt 
11461 cccggtgagc tctgggaatg tcatgctcgg cgtgtttact gggcagatga ggaagggcac 
11521 aagccccgct tttgtccctt ggtggaaagg agcaccccac aactcagcct tccgtggtgg 
11581 atctgggcgg gggccgtgtg agttccccag ggccttcctc atgagctttc acgacgtctg 
11641 tgttctttgt agatggaaca cagtcatcta tttgcagccc cttggctcat ctccagactg 
11701 aaattaccct taatttgaag tagataagtg ataggaaaca ggccagatgt ggtggttcat 
11761 gcctgtaatc tcaacacttC tggaggccga ggcagaagga tcacttgagc ccaggagttt 
11821 gaggctgaag tgaactgtg^i tcacgccact gcactccagc tagggcaaca gagcaagacc 
11881 ccaactctaa aagagagaga aagaaggaga gataggaaac gaaaagtaga ccatactgga 
11941 ttgttccaag tataactcag gagagtcgta ggagcggcct tctctaacac caaatgcaga 
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12001 gtgtgtttgt acattgaaca cctgggaacg tgacaggtgg cgggagagag gctgcagctc 
120 51 atgccacatc ctcacgcctc ctcctgtggc cttctccctg tctcccaaca tagtttgccc 
12121 taggcctttg gaggacgcca gctatttcta gagcttccac cacttgtcag accgtggggt 
12181 tggggatgtc tacagactgg ttggtttctg agttgccctg atccccaaga tggctcccac 
12241 tactgactgt tatcctcagg agtgtgaaat gctaaaagcb ttgggcatct ccttttaaaa 
123 01 tgcctgggcc cagccaggcg cagtggctta cgcctgtagt cccagcactt ttggaggcca 

123 61 aggcaggagg atcacttgag ctcaggattc aagaccagcc tgggcaatat agtgagaccc 

124 21 catcttgatt ggatgggtgg tggatggatg gatggatgga tggatggatg gatggatgga 
124 Bl tagatagata gatagataga tagatagata gatagataga tggaaagaaa gaaagaatag 
12541 acaaatgata ggataggaga aagaaaggga aagaaagaaa agagaagaga aaagaaagaa 
12601 aagggatagg atagatgaca gatgatagga tagattagat aggtgataga taatcgataa 
12661 atagaaaaca gatgatagat ataggataga ttagatagat agatagatag atagatagat 
12721 agatagacag ataatagata gattagatta gatggatgga caccagcccc caatgagagg 
12781 aggatccaac tagaatcttt tggggtcttt gccttttatt ccttatatga ggaggtcatt 
128.4.1. at.cag.caatt....ct.tca3tctc . atccctagca. aattgctttc cagaaac.tct. aaatcaatta 
12901 tttggcacct ttatacatat aaggaggaaa cagaacctcc tcaaagagaa atgtcacttg 
12961 ctgaccctga ccgagtcttc tctagaatgt ttgatttata atttcagtta ggaaagaaaa 
13021 ccagccagat ttgagcccca tgaggcctca tgtttttact cctctgagat aatctggttg 
13081 gcagggaatg tttttagcac ttgttggctc tttccctcag taagtcaagt ttgattgaga 
13141 aaccattgtt tcagtttgca gacccccccc tttttttaac tgcagtcaaa ttcacttata 
13201 gaaactaacc attttaagtc gggcacagtg gctcatgctt gtaatcccag cactttggga 
13261 ggctgaggca ggtagatcac ttgaggccag gagttggaga ccagcctagc caacatggtg 
13321 aaatcctatc tctactaaag atacaaaaat tagccaggtg tggtgatgta cacctgtagt 
13381 cccagctact caggagggtg aggctggaga atcgcttgaa cccaggaggc ggagattgca 
13441 gtaagccaag atcgtgccac tgccctgcag cctgggcgac agagcaaggc tctgtctcaa 
13501 ataaataaat aaataagaaa gaaattaacc attttaaaat gaaaaattca gcacatttag 
13561 tgcattcaca gtgtCatgca agcatcacct ctatctagtt caaaacatt-t ccatcacccc 
13 621 aaagggaagc ctcgtatgtt acacaaccac tccccattcc ccacttctct cactccctgg 
13 681 caacccgtaa gtgctttgtc tctatggact tgactgttct ggaaatttca catcaatgta 
13741 atggtatcct gtgtggcctt tgtgtctggc ttctttcact cagcatcatg tttttggggt 
13801 tcatccatgt tgtagcctgt atcagtgctt tattccttta ttttattttt tgagatgaaa 
13 861 tctcactctg tcacccaggc tggagtgcac tggcgtgatc tcggctcacc gcaacctcca 
13921 cctcccaggt tccagtgatt ctcctggctc agccccctga gtagctggga ttacaggtgc 
13981 ccgccaccac acccggctcc ttttgtgtgt gcgtgtgtct gtgtgtgttt gtgtgtgtgt 
14041 gtgtgtgtgt gtgtgtgtgt ttttagtaca aagtttcacc atgttggcca ggctaatctg 
14101 gaattcttga cctcaggtga tcctcctgcc tcagcctccc aaagtgctgg aattacaggt 
14161 gtgagccacc atgcctggac ccttcattcc tttttattca tggctgaatc acattccatc 
14221 atgtgactag actaggacag acatttgggc tgtttccacc attttttttt tttttttgag 
14281 acagagtctc gtgtcatcca ggctggagtg cagtggcgcg atgtcggctc gctgcaacct 
14341 ccgactcccg ggttcaagca attctcctgc ctcagcctcc caagtagctg ggactacagg 
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144 01 tgcccgccat cacacccggc taatttttgt atttttagta gagacggggg tttcactgtg 
14 4 61 ttgaccaggc tggtcttgaa ctcctgacct cgtgatccgc ccgcctcagc ctcccaaagt 
14521 gctgggatta caggcttgag ccaccgcgcc cgacctgttt ccaccttttg gctcttgtga 
14581 aaagtgttgc tctgaatatt tatgtacaag gacttgtttg agtcactgtt ttccattggg 
14641 ttataaactt tgatgggaga gagcgggaga gcacgcggtc agtgtttgag ttgtcctctt 
147 01 cagtcagcgt ctgcctttct tggctctccg tgagtcctct gagYgtggct tgcecgtgct 
14761 gtctcccctc tgcaggagga aaactccctc ttcgtcatgra ccaacgtgat cctcaccatg 
14S21 aaccaffSLcac agggcctgtg ccccrgaggta ggaggccccc gggaagagcc ccaggcccca 
14881 cacccctctc cacgcctgtc cacctgtgtg tggggccggg ccacgtggac tttcttttcg 
14941 ctccttcttt ttccagatte csLgatgcgac c^ccgcgtgt aaatcagatg ccagctgtac 
15001 tgccggctct gccggcsccc acagcaacgg tacgagctcg tggcctcctg gggagggcRg 
15061 cccctgagca gatcgccccc actgtggagc gtctctgata gagaaatctt cccaatbcct 
15121 tcacatgacc ctgggtgagc caggtgccga ggctggggtc cCggagcccc tctacattca 
15181 ctgctgtcat tggagcccca caagccatcc cagctcttgc cctactgttt ttttttttgt 
1S241 tgttgttgtt gttttttctt gttttttttt ttgttttgtt tttttgtttt tttgagccag 
15301 agtctcgctc tgtcacccaa actggagtgc aatggcatga tctcggctca ctgcaacctc 
15361 cacctcccag tttcaaacaa ttctcctgcc tcagcctccc gagtagctgg gattacaggc 
15421 atgcaccacc acgcccagct aatttttgta cttttggtag agatgggttt taccatgttg 
15481 gccaggctgg tctcgaactc ctgacctcag gtgatccacc cgcctcagcc tcccaaagtg 
15541 ctaggattac aggcatgacc cactgtgcct ggcctcttgt cccattcttt agcttggcat 
15601 caccqtggct gagatgtggc tggcacacag gtaactgtct tcctccgatt ctaactcctg 
15661 gacagtgaca ctgtctaaat cccagatttg cacagctcaa gactggctgc atgggaggct 
15721 grgacggggct ctcactccct actccaaaaa ggtagaaaat aggagacccc tgtggacatg 
15781 ggacccccct gccacccttg cgcttgtagg^ agtictcaaca ggcagghgcg tagctttcaa 
15841 cgggtcYgtc aagacgtgtg aggtggcggc ctggtgccag gtggaggatg acacacacgt 
15901 gccacagtga gtccagccct agggaaggaa gtgccttttt gttttgtttt gttttagaca 
15961 cagtttcact c,tgtatcgcc aggttggagt gcagtggtgc katcttggct cacsacamcc 
16021 tccacctc,cra gtgttcaagt gattttctgt gccwcagcct cccgagtagc tgagattata 
16081 ggcacccacc accacgccca actaattttt tgtattttta gtagagacgg ggttttgcca 
16141 tgttggccac gctggtctcc aactcctgac cccaagtgat ccacctgcct tggcctccca 
16201 aagtgctagg attacaggcg ggagccacca cacccagcca gaagccacca cacccagccg 
16261 gaagcacctt caactctgcg tgtcagcctt gaacagaggt ggtcaaggag ttacacctgt 
16321 gtgttgtggc tttttgtgtg tttctggtgt ctgtatgaag tcaggctgcc ctaggatggc 
163 81 tgctctcaac cacaaggcag cgaccatcag tggcaagatg ggccacaagt agtctgtcac 
16441 tcacgagaag ggtgcaggtg actgacttcc ttgtatgtta ggactctgtc agttgcaaat 
16501 aactgacagg aaacagttct cagtgcctta agcacatgtg cacaaacaca tacacacata 
16561 cacatgcgtg cacagcgggg gggcagtgta ttgtccggct gaccaaaaag atccagcgtg 
1S621 gccaggagaa tctcgtgcca ggctgaggca gaagaatctc ttgaacctgg gaggaggagg 
16681 ttgctgagct gagatggtgc cactgcactc cctcctgggc gacagaggcc actgcactcc 
16741 tgcctgggcg acagagcgag agacaaagga caacacccta gtctcccttc ctggctctgc 
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168 01 atcccgccgt gccagcttca ctcgcaggac ccacagcata gcccgggagg cagcgctgca 
16861 cttacagccc cccaagtctg agtctccctt tccagctgtc cctggagatt ctagacctgt 
16921 ccatcatcgt atcaatggcc atatctctca acccatctct gtggctgggg aatgggtttt 
16981 gcttatcagc ttgggcctgg gtcacatgcc cacctctgca gctggggcta gagtcagacc 
17041 cacccaagtg acataggcaa gcattgggga gatatggctt ccaaaggaca tttggggttc 
17101 bgttaccaga aaagckggga gcagatgctg ggaagaaaac aacggatgtt tgctacagtg 
17161 tattttaaac taaactagac aggttcatcc tggcgcggtg gctcatgctg taatcctaac 
17221 accttgggag gctgaggcag tcggatcatc tgaggccagg agttcaagac cagcctggcc 
172 81 aacatggaga aaccccgtct ccactaaaaa tacaaaaatt agggttaggg tggcgctTCgt 
17341 ctgtaatccc agctacttgg gaggctgagg taggagaatt gcttgaaccc gggaagtgga 
17401 ggttgcagtg agctgagatc gctcttctgc actctagctt gggcgacaga gagagactct 
17461 gtctcaaaaa ataaaaattt aaaaaaataa ataaataaaa atttaaaaat caactaaact 
17521 agccaggctc aaggttgctc tagaataaca ttcttcgtgg ttgtaattgg ctgtgcttca 
17581 tggggtggag tccgggcagc gctctgagaa gtgcgtgcag cctagggcct ttctcaccga 
17641 gagctcagac cgaggtctcc atgagccccg gggagcagga agccactgaa cctcatcctg 
17701 ggtgtgttga gacagaagga aaggctgaga actgcagatt tagagattgc taaggcaaag 
177 61 gaaggagata gaaaaagaaa aggaatccta gcatcttaga gctggagagg atcfccacaga 
17821 gctgtgctct ccaacacagg agactctggc cacgcgtggc Cattgaaaat tgaaatatgg 
178.81 ctgggtccga aaggagatgg gctgtccata ggaagtgcac ttatcatcaa attggat:ttt 
17941 tttttttttt ttttgagacg aagtctcgct ctgtcgccca ggctggagta cagtggcgtg 
18 001 atctcggctc accacaacct ctgcctcccg ggttcaagtg attctcctgc ctcagcctcc 
18061 tgagtagctg ggattatagg caccccccac cacaccttgc taatttttgt atttttagta 
18121 gagatgcagg tttcaccatg ttggccaggc tggtctcgaa ctcctgacct caggttatcc 
18181 acctgcctca gcctcccaaa gtgctgggat tataggtgtg agccactgca cccagcccga 
18241 ttctgtttat tatttattta ttttttattt atttgtattt atttatttat ttgagatgga 

183 01 gtctcgctct gtcgcccagg ctggagtgca gtggcgccat ctctgttcac tgcaagctcc 
183 Gl gcctcetgcg ctcacgccat tctcctgcct cagcctcccc agtagctggg actacaggcg 
18421 cccaccacca cacccagcta atttttttgt atttttagta gagagggggt ttcactgtgt 

184 81 tagccaggat ggtctcgatc tcctgacctc gtgatccacc tgccttggcc tcccaaagtg 
18 541 ctgggattgc acgcgtgagt caccacgccc ggccctgaat atttattatg aaaaaaagaa 
18601 tgtataatat ctcattgatt tttgtatgtt ggttacatgt tgaagtaata ttttgggtta 
18661 ctgtataaaa ttttacttgt ttcttttact ttgtagtgtg gctattagga aactttaagt 
18721 tggagtgtgg cttgtagttg tgtctcataa gatacttttt ttttgacaca ggatcttgct 
18781 ctgtcaccca gtgcagtgat aacgatcatg gctcactgca gcctcgaact cctggactca 
18841 agtgatcttc ccgcctcatc ctcctgtgta gctgggacca caggggcgca ccaccatgcc 
18901 cagccaactt attttttgta gagacagggt ct caeca tgt tgcccaggct ggtcttgaac 
18961 tcctgggctc aagcaaccca cctgccttag cctcccaaag tgctgggatt acagcgtgag 
19021 ccaccacacc tggcctcaca atatactttt attggacatc actgttgcag atctaatgcg 
19081 gtgtaaaatg atttttaagt gcacactgat ataagaaata acattggcca ggcacagtgg 
19141 ctcatgcctr taatccyarc actttgggag gcwgagrcrg kyggatcayt tgaggccagg 

FIGURE IH 


Patent Application Publication Jan. 29, 2004 Sheet 9 of 51 US 2004/0018533 Al 


19201 agttcaaawy cagcctggcc aacatggtga aaccccctct ctactaaaaa tataaaaatt 
192 61 agccaggcgt ggtggtgcgc acctgtaatc ccactactca ggaggctgag gcacgagaat 
19321 tgcttgaacc cgggagatag aagttgcagt gagccaagat catgccactg cgctccagcc 
19381 cgggtgacag aataagaccc tgtctcaaaa acagaacaaa gactaaaacg ttttggaact 
19441 agacagaggC gatgattgca caacattatg aatgtact-aa atgccacaga attgtccagt 
19501 ttaaaatggt taattttatg ttatgtgaat ttcatctcaa tttctctaaa aaggtgagtt 
19561 gatcttaaag tattacatgt gaggtacaca ggtgtgtgtg tgggtgttac ggtacaggaa 
19621 atgacaaaaa cttga^cacc cctgttgtaa gccatccctc ttgagggagt ggggactttg 
19681 aagacctgag agaagtccag gcacagtggc ccatgcctgt aatcccagca ctttgggaga 
19741 ccgaggcggg tggatcaCcc gaggtcagga gtttgagacc agcctgacca acatggtgaa 
19801 accccatctc tactaaaaat atgaaaaaat tagccaggcg tggtggtgca cgcctgtaac 
19S61 cccagctact cgggaggctg aggcaggaga atcacttgaa cccagaggtt gcagtgagcc 
X9921 gagatygcgg cattgcactc cagcc tgagc aacaggagca agactccatc tcaaaaaaaa 
19981 aaaaaacctg agagaagccc cttggtccca gcctttctct gacagcagtg gtgacaggct 
20041 cagctctcct ccgagtgcag ccctgtcact grccttgctc ctgtctcagg gctgggaaga 
20101 ctgttgatgt tgtcattcca aagatcccac ctggatcagg ggaacatccc ccacagaagg 
20161 gttagccata cagtgccaga ttctccagga gaaattcacc aaagaaatgg agtccccctg 
2 0221 gggacagatt caacttgtat tgtcagccag gagctgacgt ggcacttctg agaagaggcc 
202S1 ggcgcacctg ctggcgggtg ctttgtgcac ttttcagaca ggtcaggatc cagcctgtag 
20341 gcaaatttac ttttgctttg acctgtaaaa ccggatctgc ccagccttca ttctytccct 
20401 ggagaacgcc tgcggcccca aagccaggcc tactgatttc cagtgaggcc acaaatcccc 
20461 tccctggtta gcaattcagt tttcaccgcc tcgggggaga gggcctgccc ttccttgagg 
2 0521 ggaagaacct ggaaggctcg gctcagtgtc ttctcttgga agagaagagt gtgcatgcag 
2 0581 aagggtgtag aaaatgctag acgtgtttca Ccttcttgac aaaatgacat tgtaagatgt 
20641 gtgtatatgt ttttaaaaat attacatagg ggctgaacat ggtggcccac tcctgtagtc 
20701 cagcacaggg aggcggaggt aggaagattg cttgaggtca ggagttcaag accagcctgg 
20761 gcaatgtagg aagacttcat gtctacaaaa aaaaaaaaaa aaaatttcat tagccaggca 
20821 cggtgatgca tgcctatagt tccagctact tgagaggctg aggtgggaga accccctgag 
2 0881 cctgggaggt cgaggctaca agaagctgtg ttcrtgcaac tgtactccag cctgggcaac 
20941 agagcaagac cc tgtctcaa aaatatatat aggctgggcg tgatggctta tgcctgtaat 
21001 cGcagcactt tgggaggctg aagcaggcgg atcacttgag gccaggagtt ggagaccagc 
21061 ctgaccaaca tggyggaacc ctgtctctac taaaaataca aaaaaagtta accaggcata 
21121 gtggctcaca cctgtgtaat cccagctaca cgggaggctg aggcggcagt gagctgagat 
21181 cgcgccaCtg cactcaagcc tgcactcgag gtgacagagt gactccatc't caaaaaaaaa 
21241 aaaaaaaawa tafcatatata tatatayaya cacacacaca casacasaca cacacaayac 
21301 aatacatata tatatgtgtg trtatatata tatataacat ggttatgtga gtatttgtat 
21361 gtattttatt aacagtaatg ttatcattgt ttttcagacc tgctttttta aaggctgcag 
21421 aaaaattcac tcttttggtt aagaacaaca tctggrtatcc caaa tttaat ttcagrcaagt 
21481 aagtggtggc caggtgtgtg agttcaccag ggtcttggag aaacttctgg ctcttctctc 
21541 ttccccgagg ttttcgtcgc tctgattttc tgcttcctct cgactttagg aggaatatcc 
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21601 ttcccaacat caccactact tacctcaagt cgtgcattta tgatgctaaa acagatccct 
21661 tctgccccat attccgtctt ggcaaaatag tggagaacgc aggacacRgt ttccaggaca 
21721 tggccgtgga ggtgggtgcg ggccctggct: ctcctgaccc agccctggag gcgtctcgtg 
21*781 ccaggtgctg aggaaagcct tgccgtgtct ctgctgctca tccccaggga ggcatcatgg 
21841 gcatccaggt caactgggac tgcaacctgg acagagccgc ctccctctgc ttgcccaggt 
21901 actccttccg ccgcctcgat acacgggacg ttgagcacaa cgtatctcct ggctacaatt 
21961 tcaggtgggc gtgagcttgg gcccctcgct catgttgtag ggggtgctgg tggctgcgta 
22021 cgtgccagtg ggccgcccac tgaagaccag cacccaggca gcaccccaag ggcaggctgc 
22081 cggtcccccg tccaaggcgg cgggaaggcc atgctgggaa aatgccctta gtggtgtcct 
2 2141 gctccggggc catcccggcc cccgagaccc ctccttgccc tttcctgccg caagaaacat 
2 2201 gttgagatgg ttcttagagc agccaggaga agctgggggc ttaagctttc cagcacctgc 
222S1 ctcagccatg acctccattc actgcctcaa ggagcggatg atcttgtgat cctccagtcc 
22321 aaaggcctct ggggcctggc ccaggaattg gtttctcaaa ggttgaacct gtgccagaat 
22381 ctccagagtg caggggacac aggtctgagg cccctgaaag gccttgctgg gttccctgct 
22441 gcaaatgctg gcatctcggt gtcctggggc agatgtaagc gagtcccaca aacagtggct 
22501 tgcaactaca ggactfcgatt ctctcacagt tctcacagtt ctagaggcca caacctaaaa 
22 561 tccaggcatc agcagggctg tgctcgctct gaaggctctg gagagagtcc ttacttgLct 
22 621 ctcctggctt ctggtagccc acggtgttcc ttggcttgtg gatgcatcac tccagtctct 
22681 gcctctcctg tcatgaggcc ttctgccttg taYgtgtctc tgtgtctttt cctcttgtaa 
22741 ggatatcagc cactgaattt agggcccacc ctactctagt atgacctcat cttaacaaat 
22801 tacgtctgcc aatatcccat ttccaaataa ggtcacactc ccaagcttca ggtagatgtg 
228 61 aatttttggg gacacactgt tcaacccact acagctgatg atctcatgat cagaatctat 
22921 agccaggtaa atttccacgt accctttgtt gcatttttca cagtccttaa aaaggaagcg 
22981 ctccaggccg ggcgcagtga ctcacacctg taatcctagc actttgggag gccaaggcgg 
23041 gcagatcacc tgaggccagc atggcgaaac cccatctcta ctaaaaaaaa tacaaaaatt 
23101 agctgggcat ggyggcgcat gcctgtaatc ccagcaaett gggaggctga ggcaggagaa 
23161 tcgcttgagc cagggaggca gaggttgcag tgagccgaga tcgcactgct geactccagc 
23221 ctgggcgaca aagcaagtct ccgtctcagg gggaaaaaaa agaggccctg caaggaccat 
23281 ccctgggcgc ggtcagcggt gctttgggct gcacttgcct ccccctagtg gtgacgtgtg 
23341 gaatccgatc catacgggat cctaagacct ctgtaattgt gggatctgct gtccttttga 
234 01 accttctggc tcactgagta aagcagatat ttgaattttt cttttctagc agttagattt 
234 61 gtgttattcc atctctactg tcacacactt cttgtggaaa cctcaagctt aggtcagagc 
23521 cctaaaacgg aaagamctca ggacgcctat cccccaacct cgggcagcac caggccctgc 
23581 ctgtgtctcc caggcctggc ccatggcgtg gaagttgtgg gccagataga gcatggagct 
23641 agggtgcggg ccgaggctcc tgatggatcc ctggagaaca tggggcagca gatggtatgt 
23701 cttgacggtt cttcaccaac catccatccc ctgctcatgc gtgtgygctc tctcgcgtgc 
23 761 tcgctctctc tctctctcac atatgtatat gtatatatgt gtatatatat atatatatag 
23821 ccattatata tatagccatt atatatatat atattttttt ttyttcttct tttttaaggg 
23 381 agtgacttgc tctgtcaccc aggttggagt gcagtggtgc gatbtcagct cactgcagcc 
23941 tttgcctccc gggttcaaga cattctcctg cctcagccgc ctgagtagct gggattacag 
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24 001 gcgcccgcca ccacacccag ctaatttttt gtatttttat tagagacggg gtttctccgc 
24061 attggccagg gtggtctcga gctcctcacc tcaagtgatc cacctgcctc ggcctcccaa 
24121 agtgccggaa ttacaggtgt gagccaccgc acccagccat atatttttaa atggtcttac 
24181 tgagatgcaa ttcacatgcc ataccgttca carcatttgaa gtgcagaatt catagttttc 
24241 actatatcca cagataagtg caggcttcac cacagtcgat tttagaacat ttccgtcatc 
243 01 tcaaaaagaa accctgcacc cttctgtcat cccccggccc ctggcaacag tggtctactt 
24361 Cctgtctccg cagatgtgcc tcttctgagc atttcatata atcaatatgt ggccttttgc 
24421 gtctggctcc ttccccttag cctgatgttt tcaagcttca tccatgtcgt agcctgtgtc 
24481 agtacctcgt ctctttttat ggccaaatca tattctgctg tgttgacaga cgacgttttg 
24541 ttcatctctt tatctgtgtg acgaacgagt ggttcttttt gttgttgttg ttgttgttga 
24601 agactgtgtg cacagaatca atgtttcttt gaaaaaataa ataaatttaa aaagccctag 
24661 gtcaagagab tgagaccrtc atggccaact tggtgsiaacc ccgtctctac taaaaaatcc 
24721 aaaaattagc tgggcatggt ggtgtgcgcc tgtagtccca gctactcggg aggctgaggc 
24781 agaagaattg cttgaactgg ggaggtggag gttgcagtga gccaagattg gtgcccctac 
24 841 actccagcct ggcaacagag ggagactcca tctcaaaaaa aaaaaaaaaa aaaaagtccc 
24 901 tgcactgatg ctgtgttggg gctggcttag tcccctcctg caatgctggc tggtcacatt 
24 9 SI cttactgtta gatgttggag gtcagggtcc cttgagggag gcaggcagga tgtaccaggt 
25021 gggatgttga gagcaaactg ttcaggttca ggagaggaca ctggtgctgg aggaggaggt 
25081 ctccctgtgc ccctgtacct cgtgggcccc gcatcctcct gcttgcacag cccgcctcag 
25141 ccaggagagg ccccgagccg ctccagcgtc cattcagccg gcagagtgga ccattcacct 
25201 gtgccagctc cactctaacg ttctctcaca gggcccaagg tcctgcccca gccatctccc 
2 5261 cctgatccafc cctccttccc ctcaggtttg ccaagtacta cagagacctg gctggcaacg 
25321 agcagcgcac gctcatcaag gcctatggca tccgcttcga catcattgtg tttgggaagg 
25381 tagctcgccg ccactggctc ccctccgtca ctccct'gcag ggacaagggg cctctccctg 
25441 cccctgcaga aacacttttt ttctttttcg gtgtcttggc aggcagggaa acttgacatc 
25501 atccccacta tgatcaACst: cggctctggc ctzggcactgc tagrgrca tggt gagtggttta 
25561 ggccctgcct tcaccctcac ggtgaggtga gaccctgggc tggggtcctg gtcctggccc 
25621 taggccctag acctcagatg tgtttctaaa cttgaccctc ctaccttctt tcccttggcc 
25681 GCcagcGctc ctcccacctt cccttctcca agaccacccc cctcaggtcc cagccttctc 
25741 ccaagagatg ggagtgcctt tccattccgg taaagattcc aggcttctca ggaaggggca 
25801 cgcaaagaat aagatgggtt gatgggttgc aagcatcctg gctcactctc accctatgct 
25861 aaactcaggrc gaccgtgctg tgtgacatca tagtcctcta ctgcatgaag aaaagactct 
25921 actatcggga gaagaaatat aaatatgCgg aagaCtacga g^caggtaggc ccctcctggc 
25981 ccccagcagg cacaggcctc tcatctcttg ggtgtgggag ccctgggcgt gggcctgtct 
26041 ggggaggccc ttctgcagag gctggcacca gtgtggcgtg gtgtccccgt taacccgggc 
26101 agtcctgcca ctctcagcag ctgctccatc cctaggcccc tgtactagat tgactcttta 
26161 aacccagcct cgtttcaatg arcgacatct caggttggtg atgataatgc atgctctgag 
26221 aatgcctgtg ggcacacact acttcagtgc accttgcgga acaggaaggg ttgggttcca 
2 6281 ggcctgggac caacttgaga acccctggcg gtgaagt ccc aggagcgcac ctccctcccg 
26341 cctgccacaa ggggtcccag gggcaccttg atctgcttgt gtccttcttt gcagggrtctt 
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26401 ^ctacftffagc tggaccagta acfgcctaccc caca cctargrgr ctctccacagr ccccatcaaa 
26461 gaacacfagag gaggaggagg gagaaatggc caccacatca ccccagagaa atttctggaa 
.26521 ZctgatHgag tictccactcc acaagcactc agggttcccc agcagctcct gtgtgCtgtg 
26581 tgcaggatct gtttgcccaa tcggoccagg agrgtcagcag tctgttcttg gctgggtcaa 
26641 ctctgctttt cccgcaacct ggggttgtcg ggggagcgct ggcccgacgc agtggcactg 
26701 ctgtggcttt cagggctgga gctggctttg ctcagaagcc tcctgtct cc agctctctcc 
2S761 aggacaggcc cagtcctctg aggcacggcg gctctgttca agcactttat gcggcagggg 
26821 aggccgcctg gcCgcagtca ctagacttgt agcaggcctg ggctgcaggc ttccccccga 
2 6881 ccattccctg cagccatgcg gcagagctgg catttctcct cagagaagcg ctgtgctaag 
26941 gtgatcgagg accagacatt aaagcgtgat tttcttaatc cctgtctgtt gtctcatagc 
27001 atgtgctaga actttccttc ctaccctttt acgacaacca gtaaatccac tattccggct 
27061 cactgtgctc agaaaaggtc catgggatgg tctgtttctg gcacttatgc acattttccc 
27121 ctacttcttt atttatttat tttatttttg aggcagagtc tcaccctgtc gcccaggctg 
27181 gagcgcagtg gtgcaatctc ggctcattgc aacctctgcc tcccgggttc aagcgattct 
27241 cctgcctcag cctcccgagt agctgggatt acaggtgcat gccaccatgc ccggctaatt 
273 01 tttgtatttt tagtagagac agggcttcac cacattggcc agccaggctg gtctcgaact 
27361 cctgacctca aatgatccay ctgccttagc ctctcaaagt gattggaCta caggtgtgcg 
27421 ccaccacacc tggcctattt atttttttta ttacccaggc tggagtgcag tggcgcaatc 
274 Bl actgctcact gcagtctcca cctcctgggc ttaagcgatc ctcctacctc agcctcccat 

27 541 agtgctggga ttacaagcat gagccactgc acccagctgc ctacttttta aagcctaata 
27601 atttgtcatt agcttgtctt cctgcacagt tcacaggaca ctttttctct agtgacatct 
27661 tacctgtgcc tccattttga cataaaatta caattctatg ctgtgggtaa cacaattttt 
27721 ggCaacatgg aagtggctbc ctctcccttc cctgtttgct tcccaaaagt ctccaaccaa 
27781 aagcagcagc acgcatacct agctccgcac aggttcgaaa tgttcatgaa gctatttcca 
27841 gctgcagctt cacaccaact aaatgcttgt gtttctctgt ttttgagaca gagtcttgcc 
27901 ctattgccca ggctggagtg cagtggcatg atctcggctc actgcaacct ctgcctcctg 
27961 ggttcaagca attctcctgc ctcagcctcc caagtagctg ggattacagg catctgccac 

28 021 cacacccagc taatttttgt atttctagta gagacagagg tccaccatgt tggccaggct 
28081 ggtctggaac tcctgacctc atgatceacc gcctcagcct cccaaagtgc tgggattaca 
28141 ggcgtgagcc actgcaccca gcc^ca.cacc agctgaattc tgcatcaggt attcagaatc 
2 82 01 cccctagagg aaqcagtttg cccttcgcct ctgccatgaa ttcacttcca agtactgaga 
28261 ggaaccatta ttttacaaat actctgcttc tgctggataa atggccaacc tgggcttttc 
2 8321 ttgcttttgg aaaaaaaaaa aaaaaaagtt taccaactcc ctttattgaa gctctggttc 
2B381 atgctgcctt ttaaaaaaaa atgcttgttg cctgtaatcc caggactttg ggaggccaag 
2B441 gtggatcayt taagcgtagg cattcaagac cagcccagtc aayatggtga gaccctatct 
2 8501 ctacaaaaaa ttaaattagc cgggtgtggc ggtatatatc tgtagtccca gctacttggg 
28561 tggctgaggc crgaggatca cttgagcqca ggaggttgag gctgcagtga gctatg 
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Ens, #181 
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ctctcctaaa qttqtgatag cattttctca qtggaaagga tgaccacaga attatctccc 

ctgggctgca actaaggaaq tatttggagt gaqaatguga »j;c:;dgyc::Ltca aatcr.aggag 
ctqygctqca actaaggaag tatttggagt qagaacgcga ccaggcttca aatecaggag 

ctgggctgca actaaggaag tatrtggaqt gagaatgcga ccaggcttca aatccagqaq 

gggtatttta caqggcgtqt qqctctcacc cagggctcac aggaciiirctg Htntttagqc 
gggtatttta cagggcqtgt gqcuctcacc caqqgctcac aqgacacctg dLaLLtagqc 

qqgtatttca caqggcgLqL ggctctcacc cagggctcac aqgacacctg atatttaqqc 

tgttccccaa gaqgctaagg acagcqcggc cccaycLccc ccaaccaciag ctqcccaagt 
tgttccccaa gaqgctaagq acagcqcqqc cccagctccc ccaaccacag ctgeccaagt 

tgttccccaa gaggctaagg ac^grrgcrggc cccaqctccc ccaaccacag ctqcccaaqt 

ttaggagccc qcactcqtqa gcqatggcta ctccgcacag cnccgggqtc ctctacggaa 
ttaggagccc gcactcgtga gcqatqqcta ctccgcacag ccccggyyLt: ctctacggaa 

ttaggagccc gcactcgtga gcgatggcta ctccgcacag ccccggggtc ctctacggaa 

atccctggca cctatcagta cccctaaggg aggtcgctcg ttacccctca cttgaactct 
atccctggca cctatcagta cccctaaggg aggtcgctcg ttacccctca cttgaactct 

atccctggca cctatcagta cccctaaggg aggtcgctcg ttacccctca cttgaactct 

qtcccaaatg ggaggtcact ctctcttcgc ggsccccatc ttccccgaaa qggtctcttq 
gtcceaaatg ggaggtcact ctctcttcgc ggaccccatc ttccccgaaa gggtctcttg 

gtcccaaatg ggaggtcact ctctcttcgc ggaccccatc ttccccgaaa gggtctcttg 

ttctctctgg ccctcctgag aaggtcatta ctccccaact cgcctttttt gttgtttttt 
ttctctctgg ccctcctgag aaggtcatta ctncccaact cgcctttttt gttgtttttt 

ttctctctgg ccctcctgag aaggtcatta ctccccaact cgcctttttt gttgtttttt 

ggtttctggt ttattgaggc agggtctcgc ttcatctccc agactqgagt goagtggcgc 
ggtttctggt ttattgaggc aqggtctcgc ttcatctccc agactggagt gcagtggcgc 

ggtttctggt ttattgaggc agggtctcgc ttcatctccc agactggagt gcagtggcgc 

gatudcagct cacacggcag cc tctacctc ctgggctcaa gcgatcctct cacctcagcc 
gatcacagct cacacggcag cctctacctc ctgggctcaa gcgatcctct cacctcagcc 

gatcacagct cacacggcag cctctacctc ctgggctcaa gcgatcctct cacctcagcc 

tccctgggac tacaggcgag cgccactacg oocgcttttt tttttttttt tttttttttt 
tccctgggac tacaggcgag cgccactacg eccgcttttt tttttttttt tttttttttt 

tccctgggac tacaggcgag cgccactacg eccgcttttt tttttttttt tttttttttt 

ttqtattttt gatagagacQ agqtctcqct atgttqtcca ggctggLctc taactcaaga 
ttgtattttt gatagagacg aggtctcqct atqttqtcca ggctggtctc taactcaaga 

ttqtattttt gatagagacg aggtctcqct atgttgtcca ggctggtctc taactcaaga 

ttcctctcac ctcggcctcc qcaaqtgctg ggatgacagq tgtgagccac cgcccccgqc 
ttcctctcac ctcggcctcc gcaaqtqctg ggatgacagg tgtgagccac cgcccccggc 

ttcctctcac ctcgyccLc«j gcaagtgctg ggatgacagq tgtgagccac cgcccccggc 

ccctcgcccg ccttttqaag gagcctttcg Lcctcaaggg cgaggccact ccccccccgc 
ccctcgcccg ccttttqaag qagcctttcg tcctcaaggg cgaggccact ccccccccqc 
ccctcgcccg ccttttgaag gagcctttcg tcctcaaggg cgaggccact ccccccccgc 

ccctcgcccg ccttttgdag gagcctttcg tcctcaaggg cgaggccact ccccccccgc 

gagttccatg ccccctagaq gqtcatcgtt cccgacgggg aggtggcgcc ctcccccsggg 
gagttccatg cCCCCtagag gqtcatcgtt cccgacgggg aggtggcgcc ctcei?ccggg 
gagLLecatg ccccctagaq ggtcatcgtt cccgacgggg aggtggcgcc ctcccccqgg 


gagttccatg ccccctagaq gyLcdtcgtt 


cgacgggg aggtgqcgcc ctcccccqgg 


ccccgggccc cgaccgcccg tqctgcctcc ttccgggccc tcctccgcga tgacggcgcc 

ccccgggccc cgaccgcccg tgctgcctcc ttccgggccc tcctccgcga tgacggcgcc 

ccccgggccc cgaccgcccg tgctgcctcc ttccgggccc tcctccqcga tgacggcgcc 

ccccgggccc cgaccgcccg tgctgcctcc ttccgggccc tcctccgcga tgacggcgcc 

gceagcaggc caggcggact gggcggggct ccgagcgggg acLyggaccc ^gaccgacta 

gccagcaggc caggcggact gggcggggct ccgagcgggg actgggaccc agaccgacta 

gceagcaggc caggcggact gggcggggct ccgagcgggg actgqqaccc agaccgacta 
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gccagcaqqc caggcggact ggqcggggcc ccgagcgggg actgggaccc agaccgacta 

ggggactgqg agcggycigyc gcggccatgg cgggctgctg cgccgcgctg gcggccttcc 
ggggactgqg agcgggcggc gcggccatgg cgqgctgctg cgccgcgctg gcggucLtcc 
ggggactggg agcgqqcqqc gcggccatgg cgggctgctg cgccgcgctg gcggccttcc 

gqggactggg agcgggcggc gcggccatgg cgggctgctg cgccqcggtg gcggccttcc 

tgttcgogta cgacacgccg cgcatcgtgc tcatccgcag ccgcaaagtg gggctcatga 
tgttcgagta cgacacgccg cgcatcgtgc tcatccgcag ccgcaaagtg gggctcatga 
tgttcgagta cgacacgccg cgcatcgtgc tcatccgcag ccgcaaagtg gggctcatga 

tgttcgagta cgacacgccg cgcatcgtgc tcatccgcag ccgcaaagtg gggctcatga 

accgcgccgt gcaactgctc atcctggcct acgtcatcgg gtgagcgtgg ggccgcqcgg 
accgcgccgt gcaactgctc atcctggcct acgtcatcgg gtgagcgtgg ggccgcgcgg 
accgcgccgt gcaactgctc atcctggcct acgtcatcgg gtgagcgtgg ggccgcgcgg 

accgcgccgt gcaactgctc atcctggcct acgtcal i-gg gtgagcgtgg ggccgcgcgg 

ggggcgcguci gggtgctgcc ctcgcgtccg cqccqtgcgg cggctcatcc tggcctcggt 
ggggcgcggc gggtgctgcc ctcgcgtccg cgccgtgcgg cggctcatcc tggcci <:gqL 
ggggcqcqgc gggtgctgcc ctogegtccg cgccgtgi:gg cggctcatcc tggcctcggt 

ggggcgcggc gggtgctgcc ctcigcgtccg cgccqtqcgg cggctcatcc tggcctcggt 

cacctgggcg agtccgggag cggcgagccg aggeggtgae acjcttccctg ggccccagcc 
cacctgggcg agtccgggag cggcgagccg aggcggtgac accttccctg ggccccagcc 
cacctgggcg agtccgggag cggcgagccg aggcggtgac accttccctg ggccccagcc 

cacctgggcg agtccgggag cggcgagccg aggcggtgac: accttccctg ggccccagcc 

gccgcgccgg ggccccgggg gcgggaggct gcttgctgct gttttaaaac cacagcctgg 
gccgcgccgg ggccCCgggg gcgggaggct gcttgctgct gttttaaaac cacagcctgg 
gccgcgccgg ggccccgggg gcgggaggct gcttgctgct gttttaaaac cacagcctgg 

gccgcgccgg ggccccgggg gcgggaggct gcttgctgct gttttaaaac cacagcctgg 

gccaggcgcg gtggcgcaag cctgtaatcc gagcactttg ggagcccgag gcgggaccat 
gccaggcgcg gtggcgcaag cctgtaatcc gagcactttg ggagcccgag >■ gcgggaccat 
gccaggcgcg gtggcgcaag cctgtaatcc gagcactttg ggagcccgag gcgggaccat 

gccaggcgcg gtggcgcaag cctgtaatcc gagcactttg ggagcccgag gcgggaccat 

cgcctgagcc caggagttcg aaaccaattt gggcaacatc gtgggaccee gtctctgcaa 
cgccrgagcc caggagttcg aaaccaattt gggcaacatc gtgggaccee gtctctgcaa 
cgcctgagcc caggagttcg aaaccaattt gggcaacatc gtgggaccee gtctctgcaa 

cgcctgagcc caggagttcg aaaccaattt gggcaacatc gtgggaccee gtctctgcaa 

aaaattaaaa atcatccggg cctggtqqcg cgcgcctggg gtcccagcta ctcaggaggc 
aaaattaaaa atcatccggg cctggtggcg cgcgcctggg gtcccagcta ctcaggaggc 
aaaattaaaa atcatccggg cctggtggcg cgcgcctggg gtcccagcta ctcaggaggc 

aaaattaHHH atcatccggg cctggtggcg cgcgcctggg gtcccagcta ctcaggaggc 

tgaggtqgga ggatcgcttg agcccaagag gctgaggt-lg cagtgagcta tgatcacacc 
tgaggtggga ggatcgcttg agc(:t:aagag gctgaggctg cagtgagcta tgatcacacc 
tgaggtggcja ggatcgcttg agcccaagag gctgaggctg cagtgagcta tgatcacacc 

tgaggtggga ggatcgcttg agcccaagag gctgaggctg cagtgagcta tgatcacacc 

acttaactcg agcctgggag acagtccgag acccgtctgt ttaeaaaata aataaaggca 
acttaactcg agcctggqag acagtccgag acccgtctgt ttaeaaaata aataaaggca 
acttaactcg agcctgggag acagtccgag acccgtctgt ttaeaaaata aataaaggca 

acttaactcg agcctgggag acagtccgag acccgtctgt ttaeaaaata aataaaggca 

aagcctggct aggctgttac cctctcctgc ctgacccccc atagtgcccc ttggctgtag 
aagcctggct aggctgttac cctctcctgc ctgacccccc atagtgcccc ttggctgtag 
aagcctggct aggctgttaiT cctctcctgc ctgacccccc atagtgcccc ttggctgtag 

aagcctggct aggctgttac cctctcctgc ctgacccccc atagtgcccc ttggctgtag 

gggcttcgcc ctgaacgtct caatcttcgc tctccacttc ccatcctqct gatttgctga 
gggcttcgcc ctgaacgtct caatcttcgc tctccacttc ccatcctgct gatttgctga 
gggcttcgcc ctgaacgtct caatcttcgc tctccacttc ccatcctgct gatttgctga 

gggcttcgcc ctgaacgtct caatcttcgc tctccacttc ccatcctgct gatttgctga 

caaaaccaaa gcttggaatt tccgcttgqt tccttctggg tgggtggcct ttggtcattt 
ceaaaecaaa gcttggaatt tccqcttggt tccttctggg tgggtggctt ttggtcattt 
caaaaccaaa gcttggaatt tccgcttggt tccttctgag tgggtggctt ttggtcattt 
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caaaaccaaa gcttggaatt tccqcttggt L<;cttctggg tqggtggctt ttggtcattt 

ctcctgatca ctaCguggtg tcacLatgtg gtpgtagcfja ggtcagacty L^igogagtgt 
cttttgatca ctatgcggtQ tcactatqtg ytagtagcga ggtcagactg tagfTgagtgt 
cttttgatca ctatcjcggtg tcactatgtg qtagtagcga ggtcagsctg tagcgagtqt 

cttttgatca ctatgcggtg tcactatgtg gtagtagcga ggtcagactq tagcgagtgt 

ttaaagtctg ctccctttgt cttctgggct tgtggggett tttglrggt.ac ctgccctagc 
ttaaagtttq cttcctttgt. tttctgggct tgtggggctt tttgtggtac ctgccctagc 
ttaaagtttg ctUcctttqt tttctqggr:t. tgtggggctt tttgtggtac ctqcwtagc 


taaagtttg cttcctttgt 


tgtggggctt tttgtggtac ctgccctagc 
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ctagtcagtc attccccatg ctgccccctt aggctagaga tgcc-ctaccg cectraggcc 
ctagtcagtc attccccatg ctgccccctt aggctagaga tgccctaccg ccctcaggcc 
cEagtcagtc attcccuatg ctgccccctt aggctagaga tgccctaccg ccctcaggcc 

ctagtcagtc attccccatg ctgccccctc aggctagaga tgccctaccg ccctcaggcc 

tcgctgaatg tgccattgta cttgaaggea cctgttactt ttttaatttt tttatttttt 
tcgctgaatg tgccattgta cttgaaggea cctgttactt ttttaatttt tttatttttt 
tcgctgaatg tgccattgta cttgaaggea cctgttactt ttttaatttt tttatttttt 

tcgctgootg tgccattgta cttgaaggea cctgttactt ttttaatttt tttatttttt 

tgagacggaa tttetctett gtcgnrcggg ctggagtgca atggcacgat ctcggcteac 
tgagacggaa tttetctett gtcgcccggg ctggagegca anggcacgat ctcggcteac 
tgagacggisa tttetctett gtcgcccggg ctggagt<ica atggcacgat ctcggcteac 

tgagacggaa tttetctett gtcgcccggg ctggagtgca atggcacgat ctcggcteac 

tgcaacctct gcctcccggg ttcarigcBat tctcctgtct ctsgtctcctg agtgqctggg 
tgcaacctct gcctcccggg ttcaagcaat tctcctgtct cagtctcctg agtggctgyg 
tgcaa?ctct gcctcccggg ttcaagcaat tctcctgtct cagtctcctg agtggctggg 

tgcaacctct gcctcccggg ttcaagcaat tctcctgtct cagtctcctg agtggctggg 

attacaagtg cccgccacca cgcccggcca stttttgtat tttttagtag agacggggtt 
attacaagtg cccgccacca cgcccggcca atttttgtat tttttagtag agacggggtt 
attacaaqtq cccgccacca cgcccggcca atttttgtat tttttagtag agacggggtt 

attacaagtg cccgccacca cgcccggcca atttttgtat tttttagtag agacggggtt 

tcaggctggt cctgaactga cctcaggtga tccgcccacc tcagcctccc aaaatgctgg 
tcaggctggt cctgaactgn cctcaggtga tccgcccacc tcagcctccc aaaatgctgg 
tcaggctggt cctgaactga cctcaggtga tccgcccacc tcagcctccc aaaatgctgg 

tcaggctggt ect;^aactga cctcaggtga tccgcccacc tcagcctccc aaaatgctgg 

qattacaggc gtgagccccc acgcggggct ggcacctgtt acaattgaat aeaacgcgag 
gattacagge gtgagccccc acgcggggct ggcacctgtt acaattgaat aeaacgcgag 
gattacaggc gtgagccccc acgcggggct ggcacctgtt acaattgaat aeaacgcgag 

gattacagge gtgagccccc acgcggggct ggcacctgtt acaattgaat aeaacgcgag 

agagaagatg ataattaoct rgccacctgg ggccacttta aaggcttacc tqgggtgttc 
agagaagatg ataattacct tgccncctgg ggccacttt^i aaggcttacc tqgggtgttc 
agagoogotg ataattacct tgccacctgg ggccacttta aaggcttacc tggggtgtLc 

agagaagatg ataattacct tgccacctgg ggccacttta aaggcttacc tqgggtgttc 

tqqtgtctac agctgctggg ctgacacgta ttgcctagtg caqtggtctc rsaactgeac 
tggtgtctdc agctgctggg ctgacacgta ttgcctagtg cagtggtctc caaactgcac 
tggtqtctac agctgctggg ctgacacgta ttgeetagtg cagtggtctc caaactgcac 

tqgtgtctac agctgctqqg ctgacacgta ttgcctagtg cagtggtctc caaactgcac 

dttqgaacca tctggaaatc tttggctccc atccccaqac ccttttttat ttttttgatg 
attggaacca toLgqaaatc tttggctccc atccceagac ccttttttat ttttttgatg 
attqgaacca tctggaaatc tttggctccc atccceagac ccttttttat ttttttgatg 

attggaaccs tctggaaatc tttggctccc atccceagac ccttttttat ttttttgatg 

yagtetccct ctgtcgccca ggctggaqtg cagtggtgca atcttgqctc gctgcaacct 
gagrctccct ctgtcgccca ggctggaqtg cagtggtgca atcttggctc gctgcaacct 
gagtctcect ctgtcgccca ggctggagtg ^dgtggtgca atcttggctc gctgcaacct 

gaqtctccct ctgtcgccca ggctggaqtg cagtggtgcs atcttggctc gctgcaacct 

ccacctcccc ggttcaagcq attctcctgc ctcagcctcc tgagtagctg ggattacagg 
ccacctccct ggttcaagcg attctcctgc ctcagcctcc tgagtagctg ggattacagg 
ccacctccct ggttcaagcg attctcctgc ctcagcctcc tgagtagctg ggattacagg 
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cacnt:Ht:(jdK catgcccggg ctaacttttq tatttttagt agggacagyg tttcgccatg 
cacGcaocac catgccc-gg ctaattcttg tatttntagt agggacaggg tttcgccatg 
cacccaccac catgcccggg ctaatLtttg tatttttagt agggacaggg tttcgccatg 

cacccaccac' catgcccggg ctaatttttg tatttttagt agggacaggg tttcgccatg 

ttggccayga tggtctcgaa ctcctqgcct ciaggtgatcc acccgccgca gnntcccaga 
ttggccaggc tggtctcgaa ctcotggcct caggtqatcc acccgccgca gcctcccaga 
ttggccaggc tggtctcgaa ctcctggcct caggtgatcc acccgccgca gcctcccaga 

ttggccaggc tggtctcgoo ctcotggcct caggtgatcc «f:ccgccgca gcctcccaga 

gtgctgggat tacaggcgtg agccaccgcg cccagacgca tccccagacc ttctgattta 
gtg^Lgggat tacaggcgtg agccaccgcg cccagacgca tcc-c.cagacc ttctgattta 
gtgctgggot tacaggcgtg agccaccgcg cccagacgca tccccagacc ttctgattta 

gtgctggy^t tacaggcgtg agccaccgcg cccagacgca tccccayrtco ttctgattta 

attgctgcag agtatgatcg gctgcccagg tgattctcat gttcagcaac gtttcggaac 
attgctgcng agtatgatcg gctgcccagg tgattctcat gttcagcaac gtttcqgaac 
attgctgcag agtatgatcg gctgcccagg tgattctcat gttcagcaac gtttcggaac 

attgctgtaig agtatgatcg gctgcccagg tgattctcat gttcagcaac, gtttcggaac 

caatgaactg tgtttcccaa gtttacctga tcatggaaga ctcatctgag gaattgttta 
caatgaactg tgtttcccaa gtttacctga tcatggaaga ctcatctgag gaattgttta 
caatgaactg tgtttcccaa gtttacctga tcatggaaga ctcatctgag gaattgttta 

caatgaactg tgtttcccaa gtttacctga tcatggaaga ctcatctgag gaattgttta 

aaagtaacag atccccccgt ctttggggtg aggtttggaa tcttaacact gcccaaggga 
aaagtaacag atccccccgt ctttggggtg aggtttggaa tcttaacact gcccaaggga 
aaagtaacag atccccccgt ctttggggtg aggtttggaa icttaacict gcccaaggga 

aaagtaacag" atccccccgt ctttggggtg aggtttggaa tcttaacact gcccaaggga 

agcccaagca ggtttggctt ccaatatctg tcagcagcag cagtagatac agacagtatt 
agcccaagca ggtttggctt ccaatatctg tcagcagcag cagtagatac agacagtatt 
agcccaagca ggtttggctt ccaatatctg tcagcagcag cagtagatac agacagtatt 

agcccaagca ggtttggctt ccaatatctg tcagcagcag cagtagatac agacagtatt 

ceagaggatg gctttccagg tgcagactca acatcagaac cacctggccc acaoccagat 
caagaggatg gctttccagg tgcagactca acatcagaac cacctggccc acacccagat 
caagaggatg gctttccagg tgcagactca acatcagaac cacctggccc acacccagat 

^agaggatg' gctttccagg tgcagactca acatcagaac cacctggccc acacccagat 

ttggactcat tggctcacac ctgtagtccc agtgctttgg gaagccaagg tgggaqgatt 
ttggactcat tggctcacac ctgtagtccc agtgctttgg gaagccaagg tgggaggatt 
ttggactcat tggctcacac ctgtagtccc agtgctttgg gaagccaagg tgggaggatt 

ttggactcat tggctcacac ctgtagtccc agtgctttgg gaagccaagg tgggaggatt 

gcttgaggcc aggaqctcaa gaccagcctg gacacaagac cagcctggac aacatagcqa 
gcttgaggcc aggaqctcaa gaccagcctg gacacaagac cagcctggac aacstagcga 
gcttgaggcc aggagctcaa gaccagcctg gacacaagac cagcctggac aacatagcga 

gcttgaggcc aggaqctcaa gaccagcctg gacacaagac cagcctggac aacatagcga 

gaccccatct ctaaqagaaa aataaagttt aaaatggtaa attytactga tggagcagtg 
qaceccatct ctaagagaaa aataaagttt. aaaatggtaa attgtactga tggagcagtg 
gaccccatct ctaagagaaa aataaagttt aaaotggtaa attqtactga tggagcagtg 

gacceoatct* ctaagagaaa 'aataaa aaaatggtaa attgtactga tggagcagtg 

caaaaaaaag aatttttttt tttttttttt ctgaacacgg tggctcatgc ctataatctc 
caaaaaaaag aatttttttt tttttttttt crg^acacgg tggctcatgc ctataatctc 
caaaaaaaag aatttttttt tttttttttt ctgaacacgg tggctcatgc ctataatctc 

caaaaaaaag aatttttttt tttttttttt ctgaacacgg tggctcatgc ctataatctc 

agcactttgg gaggccgagg cgggtqgatc acctgaggtc aggagttcga gaccagcctg 
agcactttgg gaggccgagg cgggtqgatc acctgaggtc aggagttcga gaccagcctg 
agcactttgg gaggccgagg cgggtggatc acctgaggtc aggagttcga gaccagcctg 

agcactttgg gaggccgagg cgggtggatc acctgaggtc aggagttcga gaccagcctg 

gccaacatgg taaaaccccg tctctactaa aagtacaaaa aattagccag gcctagtggc 
gccaacatgg taaaaccccg tctctactaa aagtacaaaa aattagccag gcctagtggc 
gccaacatgg taaaaccccg tctctactaa aagtacaaaa aattagccag gcctagtggc 
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>Partl of Ens. #2881 
>NCBI t>29 #5641 
>NCBI old *5641 


qccaacatgq taaaaccccg tctctactaa aoqtacaaas aattagccag gcctagtg^c 

taatgcctgt aatcccg^ct acatggqagg ctg^ggcagg agaakcgttt gaatccagga 
Ldatgcctgt aatcccggct acatgygagg cty^ggcagg agaaCcgttt gaatccagga 
taatgcctgt aatcccggct acatgggagg ctgaggcagg agaatcgttt gaatccagga 

taatgcctgt aatcccggct acatgggagg ctgaggcagg agaatcgttt gaatccagga 

ggcggaggtt gcagtgagcc cagattgcgc cactgcactc cagcctgggc aacaagagca 
ggcggaggtt gcagtgagcc cagattgcgc cactgcactc cagcctgggc aacaagagca 
ggcggaggtt gcagtgagcc cagattgcgc cactgcactc cagcctgggc aacaagagca 

ggcggaggtt gcagtgagcc cagattgcgc cactgcactc cagcctgggc aacaagagca 

aaactccatc tcaaatatat atacatacat atscatstat atatatattt tttttttttt 
aaactccatc tcaaatatat atatatatat atatatatat atatatattt tttttttttt 
aaactccatc tcaaatatat atatatatat atatatatat atatatattt tttttttttt 

aaactccatc tcaaatatat atatatatat atatatatat atatatattt tttttttttt 


>Partl of Ens. #2941 
>NCBI b2 9 #5701 
>NCBI old «5T01 


>PartI of Ens. 83001 
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>HCfiI old #57€1 


>Partl of Ens. #3061 
>NCBI b29 #5821 
>HCBI old #5821 
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>NCBI old #5941 
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>HCBI b29 ttSOOl 
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>Partl of Ens.»3'121 
>HCBt b29 tt€iei 
>W:BI old »6181 


>Partl of Ens. HaiSl 
>NCBI b29 H6241 
>HCBI old . It€24 1 


>pai:tl of Ens. f»35ll 
>NCBI b29 B6301 
>NCBT old (t6301 


tttggagact cactgctagc cagttaactg aactcacatc cctggttgga cctgaagcat 
tttggagact cactgctagc cagttaactq aactoacatc cctggttgga cctgaagcat 
tttggagact cantgctagc cagttaactg a,act(2rtr.atc cctggttgga cctgaagcat 

tttggagact cactgctagc cagttaacLg aactcacatc cctggttgga cctgaagcat 

ttgggggcag gagctcccct gcagcttttt gcttttggcg tttgctgcta gtgggaggat 
ttgggggcag gagctcccct gcagcttttt gcttttggcg tttgctgcta gtgggaggat 
ttgggggcag gagctcccct gcagcttttt gcttttggcg tttgctgcta gtgggaggat 

ttgggggcag gagctcccct gcagcttttt gcttttggcg tttgctgcta gtgggaggat 

caqtgcacat ctaccttagg gtacctgtaa aattacaact gcggaaaata tccatagtgg 
cagtgcacat ctaccttagg gtacctgtaa aattacaact gcggaaaata tccatagtgg 
cagtgcacat ctaccttagg gtacctgtaa aattacaact gcggaaaata tccatagtgg 

cagtgcacat ctaccttagg gtacctgtaa aattacaact gcggaaaata tccatagtgg 

tgttgcactg tatggtagcc actagtaact tgcacctgtt gaacacttaa ccgttcctac 
tgttgcactg tatggtagcc actagtaact tgcacctgtt gaacacttaa- ccgttcctac 
tgttgcactg tatggtagcc actagtaact tgcacctgtt gaacacttaa ccgttcctac 

tgttgcactg tatggtagcc actagtaact tgcacctgtt gaacacttaa ccgttcctac 

tctacactga gatgtgctat aaccttaaaa tacacactgg atttcaaaga catagtacag 
tctacactga gatgtgctat aaccttaaaa tacacactgg atttcaaaga catagtacag 

tctacactga gatgtgctat aaccttaaaa tacacactgg atttcaaaga catagtacag 

tctacactga gatgtgctat aaccttaaaa tacacactgg atttcaaaga catagtacag 


aa;magaggc 
aaaaagaggc 
aaaaagaggc 


tgggcacagt 
tgggcacagt 
tgggcacagt 


ggctcatgcc 
ggctcatgcc 
ggctcatgcc 


tgaaat.caca gtactttggg aggcCgaggt 
tgaaatcaca gtactttggg aggctgaggt 
tgaaiitcaca gtactttggg aggctgaygt 


aaaaagaggc tgggcacagt ggctcatgcc tgaaatcaca gtactttggg aggctgaggt 


gggaagatcg 
gggaagatcg 
gggaagatcg 


cttgagctca 
cttgagctca 
cttgagctca 


agagtttgaq 
agagtttg^g 
agagtttgag 


acaaacctgg gcaacatagc gagaccttgc 
acaaacctgg gcaacatagc gagaccttgc 
acaaacctgg gcaacatagc gagaccttgc 


gggaagatcg cttgagctca agagtttgag acaaacctgg gcaacatagc gagaccttgc 


ctctacaaaa 
ctctacaaaa 
ctctacaaaa 


aaatacaaaa 
aaatacaaaa 
aaatacaaaa 


attogcctgq 
attagcctgg 
attagcctgg 


catggtggca ca cgcctata ggcctaccta 
catggtggca cacgcctata ggcctaccta 
catggtggca cacgcctata ggcctaccta 


ctctacaaaa aaatacaaaa attagcctgg catggtggca cacgcctata ggcctaccta 


tttgggaggt 

tttggyaggt 
tttgggaggt 


tgaggcagaa 

tgaggcagaa 
tgaggcagaa 


ggactgcttg 

gyactgcttg 
ggactgcttg 


aacctgggag qttgaggcca cagagagctg 
aacctgggag qttgaggcca cagagagctg 
aacctgggag gttgaggcca cagagagctg 


tttgggaggt tgaggcagaa ggactgcttg aacctgggag gttgaggcca cagagagctg 


tgatcacacc 

tgatcacacc 

t gat cacacc 


accctccagc ctgggcaaca gagcaagaca ctgcctcaaa aaaacaaaga 
sccctccagc ctgggcaaca gagcaagaca ctgtctcaaa aaaacaaaga 
accctccagc ctgggcaaca gagcaagaca ctgtctcaaa aaaacaaaga 


tgatcacacc accctccagc ctgggcaaca gagcaagaca ctgtctcaaa aaaacaaaga 

gtaccactat cacatgaaaa catttcttat tgatgacatg ttaaaatatt ttggatatat 
gtaccactat cacatgaaaa cattttttat tgatgacatg ttaaaatatt ttggatatat 
gtaccactat cacatgaaaa catttcttat tgatgacatg ttaaaatatt ttggatatat 
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gtaccactat cacacgaaaa csctttccat tgatgacacg ttaaaatatt ttggciLatat 

tgggtracac agaaCatatc attacaatta atttttcctg tctctttttt ctt^ittttct 
tggqctacac aqaacstdLc aLtar:aatta atttttcctg tctctttttt ctttttvtct 
tgggttacac ag^iatatatc attacaatCa atttttcctg tctctttttt cttLtttCct 

tgyijLLacac agarftatatc attacaatta atttttcctg tctctttttt ctttttttct 

ttttaaactig cqccttgcLt LcjLcacccag acLcjgaqtgc aatijcactqc aacctctgcc 
ttttaaacgg cgcfittgctt tgtcacccag actggagtgc aatgcactqc aacctctgcc 
tcctsaacgg cgccttgctt tgtcacccag accggagtgc aatgcactgc aacccctgcc 

ttttaaacgg cgccttgctt tgtcaoccag actggagtgc aatgcactgc aacctctgcc 

tcccaggctc aagtgattct accatct-cag cctctggaat batgggacta caggf.gcaca 
tcccaggiirtc aagtgattct accatctcag cctccggaat tatgggacta caggcgcaca 
ticccaggcLc aagtgattc^t accatctcag cctctggaat tatgggacta caggtgcaca 

tcccaggcrc aagtgattcc accatctcag cctctggaat CatQggacta caggtgcaca 

ccaccatgac tggctaatto ttgtgttttt gtttgtttgt ttgtacagat gagaaatctc 
ccaccatgac tyyctaatta tLgtgttttt gtLtgtttgt ttgLacagat gagaaatctc 
ccaccatgac tggctaatta ttgtgttttt gtttgtttgt ttgtacagat gagaaatctc 

ccaccatgac tggctaatta ttgtgttttt gtttgtttgt ttgtacagat gsgaaatctc 

atctctacaa tcatgaqijtt tccatgttgc ccaggctggt ctcyaactcc tgagctctag 
atctctacaa tcatgaggtt tccatgttgc ccaggctggt ctcgaactcc tgagctctag 
atctctacaa tcatgaggtt tccatgttgc ccaggctggt ctcgaactcc tgagctctag 

atctctacaa tcatuaggtt tccatgttgc ccaggctggt ctcgaactcc tgagctctag 

caat.ctgtct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgtgccca 
caatctgtct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgtgccca 
caatctgtct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgtgccca 

caatctgtct gcctcggcct cccaaagtgc tgggattaca ggcatgagcc. accgtgccca 

gccctttcta tttcttttta ctttttgaca tggctactag aaacttttaa aLLacagggt 
gccctttcta tttcttttta ctttttgaca tggctactag aaacttttaa .attacagggt 
gccctttcta tttcttttta ctttttgaca tggctactag aaacttttaa attacagggc 

gccctttcta tttcttttta ctttttgaca tggctactag aaacttttaa attacagggt 

cttqccctgt ggtagctcac gcctgtaa.tc ccaatacttt gggaggctga ggcaagtgga 
Gttgccctgt ggtagctcac gcctgtaatc ccaatacttt gggaggctga ggcaagtgga 
cttgccctqt ggtagctcac gcctgtaatc ccaatacttt gggaggctga ggcaagtgga 

cttgccecgt ggtagctcac gcctgtaacc ccaatacttt ggforaggctga ggcaagtgga 

tcccttgagc ttaggaattt tgagaccagc ctgggcaacg tagtgaaacc ccatctctac 
tcccttgagc ttaggaattt tgagaccagc ctgggcaacg tagtgaaacc ccatctctac 
tKccttgagc ttaggaattt tgagaccagc ctgggcaacg tagtgaaacc ccatctctac 

tcccttgai^c ttaggaattt tgagaccagc ctgggcaacg tagtgaaacc ccatctctac 

caaaaatacg aiiaaattagc ttgycatagt ggcgcacacc tgLggaccca gctacttggg 
caaaaatacg aaaaattagc ttggcatagt ggcgcacacc tgtggaccco gctacttggg 
caaaaatacg aaaaattagc ttggcatagt ggcgcacacc tgtggaccca gctacttggg 

caaaaatacg aaaaattagc ttggcatagt ggcgcacacc tgtggaccca gctacttggg 

aggctgagqt gggaggattg ctgtagcctg ggaggcggag gttgcagtga gccaagattg 
aggctgaggt gggaggattg ctgtagcctg ggaggcggag gttgcagtga gccaagattg 
aggctgaggt gggaggattg ctgtagcctg ggaggcggag gttgcagtga gccaagattg 

agyctgaqqt gggaggattg ctgtagcctg ggaggcggag gttgcagtga gccaagattg 

tgccactgca cttcagcctg qgcgacagag cgagagcctg tctcaaaaaa aaa^atatgt 
tgccactgca cttcagcctg ggtrgacagag cgagagcctg tetcaaaaaa aaaaatatgt 
tgccactgca cttcagcctg ggcgacagag cqagagcctg tctcaaaaaa aaaaaratgt 

tgccactgca cttcagcctg ggcgacagag cgagagcctg tctcaaaaaa aaaaatatgt 

ggctcgcgtt atqtttttat gqgacagcgc aqctctgtag gatfcccttat gacgca ttca 
ggctcgcgtt atgtttttat gggacagcgc agctctgtag gattccttat qacqcattca 
ggctcgcqct atgtttttat gggacagcgc agctctgtag gattccttat gacgcattca 

ggctcgcgtt atgtttttat gggacagcgc agctctgtag gattccttat gacgcattca 

ctatctcggt catgagaatc tttaaagcag taaatgcttt gcttttgtcc gtcttgctat 
etatetcggt catgagaatc tttaaagcag Laaatgcttt gcttttgtcc gtcttgctat 
ctatctcggt catgagaatc tttaaagcag taaatgcttt gcttttgtcc gtcttgctat 
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ctatctcggt catgagaatc tttaaagcag taaatgcttt gcttttgtcc qtcCtgctat 

aatcaaaaci- gact^caggac ctgcattaag qctctttttt tttttaacct catctcttaa 

aatcaaaacr gactcaggac ctQcaLLaag QCtctttt:tt tttttaacct catctcttaa 

aatcaaaacL gaetcaggac ctgrattaacj gcLcttttLt ttt-t.taacct ratctctl:na 


aatcaaaact gactcaggac ctgcattaag gctctttttt tttttaacct catctcttaa 

ctttttgatg caactcatct tagtgaaatt gtg^tctcct cctacagcaa ccctccctca 
ctttttgatg caactcatct tagtgaaatt gtqgtctcct cctacagcaa ccctccctca 
ctttttgatg caactcatct tagtgaaatt qtggtctcct cctacagcaa ccctccctca 


ctttttgatg caactcatct tagtgaaatt qtggtctcct cctacagcaa ccctccctca 

aaaaaagacc gaacacgcaa ttctcatcta tctaactctg ttaacaatca tatgaggagg 
aaaaaagacc gaacacgcaa ttctcatcta tctaactctg ttaacaatca tatgag^agg 
aaaaaagacc gaacacgcaa ttctcatcta tctaactctg ttaacaatca tatgaggagg 


aiiaaaagacc gaacaogcaa ttctcatcti^ tctaactuLg ttaat^aatca t-.atgaggagg 

aaataatcac atgataatga gCggcatgaa accctgtgtc tctaaattaa gcaactcact 

aaataatcac atgataatga gtggcatgaa accctgtgtc tctaaattaa gcaactcact 

aaataatcac atgataatga gtggcatgaa accctgtgtc tctaaattaa gcaactcact 


aaataatcac aCgataatga gtggcatgaa accctgtgtc tctaaattaa gcaactcact 

gtacttgatg agtacatcag tcttcctgcc ccaagctttt gtgtgtcaga ctggaattaa 
gtacttgatg agtacatcag tcttcctgcc ccaagctttt gtgtgtcaga ctggaattaa 
gtacttgatg agtacatcag tcttcctgcc ccaagctttt gtgtgtcaga ctggaattaa 


gtacttgatg agtacatcag tcttcctgcc ccaagctttt gtgtgtcaga ctggaattaa 

atggaagctt acactgcagc tgaattccta gaagccctgg acttgtcagg ggaaaagccc 
atcjgaagctt acactgcagc tgaattccta gaagccctgg acttgtcagg ggaaaagccc 
at-ggaagctt acactgcagc tgaattccta gaagccctgg acttgtcagg ggaaaagccc 


atggaagctt acactgcagc tgaattccta gaagccctgg acttgtcagg ggaaaagccc 

atgggggqaa agcccatctc taggctgggc aatgaacttg gagactgctc gatcacacac 
atggggggaa agcccatctc taggctgggc aatgaacttg gagactgctc gatcacacac 
atggggggaa agcccatctc taggctgggc aatgaacttg gagactgctc gatcacacac 


atqgggggaa agcccatctc taggctgggc aatgaacttg gagactgctc gatcacacac 

sctcataagt aacaaaacca ttastttctt ttcttttttt tggtggaggg acagagtctc 

actcataagt aacaaaacca ttaatttctt ttcttttttt tggtggaggg acagagtctc 

actcataagt aacaaaacca ttaatttctt ttcttttttt tggtggaggg acagagtctc 


Fir:tcataagt aaciaaaacca ttaatttctt ttcttttttt tggtggaggg acagagtctc 

actcttgc(jc aggctijgagt gcagtggcat gatctcFigcc cactgcaatc t.ccacctncc 
aetcttgccc aggctggagt gcagtggcat gatctcagcc cactgcaatc tccacctccc 
actcttgccc aggctggagt gcagtggcaL gatctcagcc cactgcaatc Lccacctccc 


aetcttgccc aggctggagt gcagtggcat gatctcagcc cactgcaatc tccacctccc 

qggttcaagt gattctcctg cctcagcctc ccgagtatgt ggqactacag gcacccacca 
ggqttcaagt gattctcctg cctcagcctc ccgagtatgt gggactacag gcacccacca 
gggttcaagt gattctcctg cctcagcctc ccgagtatgt gggactacag gcacccacca 


gggttcaagt gattctcctg cctcagcctc ccgagtatgt gggactacag gcacccacca 


ccacacccag 
coacacccag 
ccacacccag 


ctaattttgt 
ctasttttgt 
ctaattttgt 


gtatttttag 
qtatttttaq 
gtatttttag 


tagagtcggg 
taqagtcggg 
tagagtcggg 


gtttcacaat 
gtttcacaat 
gtttcacaat 


gttggccagg 
gttggccaqg 
gttggccagg 


ccacacccag ctaattttgt gtatttttag tagagtcggg gtttcacaat gttggccagg 

ctggtcttgg actcctgacc tca<icaaa.ac cdgtgaattL ctgaagLctc agt;atctagg 

CCggtcctgg actcctgacc tcaacaaaac cagtgaattt ctgaagtctc agcatctagg 

ctggtcttgg actcctgacc tcaacaaaac cagtgaattt ctgaagtctc agcatcjtagg 


ctqgtcttgg actcctgacc tcaacaaaac cagtgaattt ctgaagtctc agcatctagg 

acttagaaga aggagaagcc tagcatttcc tqattctata aataacaaca ctgatagtct 
acttagaaga aggagaagcc tagcatttcc tgattctata aataacaaca ctgatagtct 
acttagaaga aggagaagcc tagcatttcc tgattctata aataacaaca ctgatagtct 


ACttagaaga aggagaagcc tagcatttcc tgattctata aataacaaca ctgatagtct 

ctagtagaty tttaaci^gca gttcgaaggg qaggggtlict gattLgtagt aLjLggccaat 

ctagtagatq tttaacagcs gttcjgaaggg gagofggtLct gattLgtagt acjLggccaat 

ctagtagatg tttaacagca gttcgaaggy qaggggLtct gatLtgtagt actggccaat 
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>NCBI old #8821 


ctagtagacg cttaacagca gttcgaaggg g.*ggggttct gatttgtagt actgqccaat 

ttccacggtq tcagtacttg ctctqtgacc aacgacgdCt cacaaacatt gttgtaaaca 
ttccacggtq tcagtacttg ctctgtgacc aacgatgact cacaaacatt gttgtaaaca 
tcccacggcg tcagtacttg ctctgtgacc aacgatgacb cacaaacatt gttgtaaaca 

ttccacggtg tcagtacttg ctctgtgacc aacgatgact cacaaacatt gttgtaaaca 

tttgacctac attaattctt ttaccctcac aagaatcctg tgaagatcta ctatcattcc 
tttgacctac attaattctt ttaccctcac aagaatcctg tgaaqatcta ctatcattcc 
tttgacctac attaattctt ttaccctcac aagaatcctg tgaagatcta ct at cat tec 


tttgacctac atta 


ttaccctcac aagaatcctg tgaagatcta ctatcattcc 


cagattatag atgqgoaaaa ctgaggcaca atgaggttaa gcaccttggc caaagttatg 
caqattatag atgggaaaaa ctgaggcaca atgaggttaa gcat-f.:! tggc caaagttatg 
cagattatag atgggaaaaa ctgaggcaca atgaggttaa gcaccttggc caaagttatg 

cagattatag atgggaaaaa ctgaggcaca atgaggttaa gcaccttggc caaagttatg 

cctcgtgaat ggtggagctg agccatgaac caagacctct gcctccagtc tgtcctaacc 
cctcgtgaat ggtggagctg agccatgaac caagacctct gcctccagtc tgtcctaacc 
cctcgtgaat ggtggagctg agccatgaac caagacctct gcptccagtc tgtcctaacc 

cctcgtgaat ggtggagctg agccatgaac caagacctct gcctccagtc tgtcctaacc 

cctataagct aatgcctgta gatggttttt tttttttttt tttttttgag atMgagcctt 
cctataagct aatgcctgta gatggttttt tt.Ll.LLtttt tttttttgag acagagcctt 
cctataagct aal.gcctgta gatggttttt tttttttttt tttttttgag acagagcctt 

cctataagct aatgcctgta gatggttttt tttttttttt tttttttgag acagagcctt 

gccctgtcgc ccaggctgga gtgcggtggc atcacagctc actgcagcct ccgtctccca 
gccctgtcgc ccaggctgga gtgcggtggc atcacagctc actgcagcct ccgtctccca 
gccctgtcgc ccaggctgga gtgcggtggc atcacagctc actgcagcct ccgtctccca 

gccctgtcgc ccaggctgga gtgcggtggc atcacagctc actgcagcct ccgtctccca 

ggctcaagtg atcctcccac ctcagcctcc caaatagccg ggatttttat tggacagcac 
ggctcaaqtg atij<:l.ccr:ac ctcagcctCC caaatagccg ggatttttat tggacagcac 
ggctcaagtg atcctcccac ctcagcctcc caaatagccg ggatttttat tggacagcac 

ggctcaagtg atcctcccac ctcagcctcc caaatagc:r:g ggatttttat tggacagcac 

^gctctgtag gattccttat gacgtattca ctgtctcagt cacgagaatc tttaaagcag 
agctctgtag gattccttat gacgtattca ctgtctcagt cacgagaatc tttaaagcag 
agctctgtag gattccttat gacgtattca ctgtctcagt cacgagaatc tttaaagcag 

agctctgtag gattccttat gacgtattca ctgtctcagt cacgagaatc tttaaagcag 

taaatgcttt gcttttgtcc atcttgctHt aatcaaaact gaccatgccc ggctaatttt 
taaatgcTtt I. gcttttgtcc atcttgctat aatcaaaact gaccatgccc ggctaatttt 
taaatgcttt gcttttgtcc atcttgctat aatcaaaact gaccatgccc ggctaatttt 

taaatgcttt gcttttgtcc atcttgctat aatcaaaact gaccatgccc ggctaatttt 

ttattttttg tagagatccc tatgttgccc aggcttgtct cgagctcttg ggcacaaatg 
ttattttttg tagagatccc tatgttgccc aggcttgtct cgagctcttg ggcacaaatg 
ttattttttg tagagatccc tatgttgccc aggcttgtct cgagctcttg ggcacaaatg 

ttattttttg tagagatccc tatgttgccc aggcttgtct cgagctcttg ggcacaaatg 

atcttcccci! [jttgacctcc caaagttctg ggattacagg cqtgagccac tgtacccagc 
atcttccccc cttgacctcc caaagttctg ggattacagg cgtgagccac tgtacccagc 
atcttccccc cttgacctcc caaagttctg ggattacagg cgtgagc;t:Kic tgtacccagc 

atcttctrccc: cttgacctcc caaagttctg ggattacagg cgtgagccac tgtacccagc 

ccctttggat ttttaattgt tcaaactggt agtcatagtc ccacttcaga ggagagaacc 
cQctttggat ttttaattgt tcaaactggt agtcatagtc ccacttcaga ggagagaacc 
ccctttggat ttttaattgt tcaaactggt agtcatagtc ccacttcaga ggagagaacc 

ccctttggat ttttaattgt tcaaactggt agtcatagtc ccacttcaga ggagagaacc 

atatatgtga ccagaattct tgggggaaaa gtaagaaact actcttaaaa gttccttaaa 
atatatgtga ccagaattct tgggggaaaa gtaagaaact actcttaaaa gttccttaaa 
atatatgtga ccagaattct tgggggaaaa gtaagaaact acLcttaaaa gttccttaaa 

atatatgtya ccagaattct tgggggaaaa gtaagaaact actcttaaaa gttccttaaa 

ggqtagaatt ctctttteta qaaagagtag cctagggttg tccisatcttt tggctttcgt 
gggtagaatt ctctttteta gaaagagtag cctagggttg tccaatettt tggctttcgt 
gggtagaatt ctctttteta gaaagagtag cctagggttg tccaatettt tggctttcgt 
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ggqtagaatt ctcLLULcfca gaaagagtag cctagggttg tccaatctct tggctttcgt 

ggqccscar.t. qgaaggaqaa attatcttgg gccacacatt aaatatgcta acactaacaa 
gggccacact qgaaggaqaa attatcttgg qccacacatc aaatatgcta acactaacaa 
gggccacact qgaaggaqaa attatcttgg gccacacott aaatatgcta acactaacaa 

gggccacact ggaaggogaa attatcttgg gccacacatt aaatatgcta acactaacaa 


tagctgatga gctaaaaaac asactttttt tgatcacaca aaa^ 
tagctgatga gctaaaaaac aaactttttt tgatcacaca aaa. 
tagctgatga gctaaaaaac aaactttttt tgatcacaca aaa. 


ate tcataacqtt 
ate tcataacqtt 
ate Lcj^Laacgtt 


tagctgatga gctaaaaaac aaactttttt tgatcacaca aaaaaaaatc tcataacgtt 

ttaagaaagt ttacgaattt qtgttgggcc tcattcaaag ccatcctgqg ccgcatycgg 
ttaagaaagt ttacgaattt gtgttgggcc tcattcaaag ccatectggg ccqeatgcgg 
ttaagaaagt ttacgaattt qtgttgggcc tcattcaaag ccatectggg ccgcatgcgg 

ttaagaaagt ttacgaattt qtgttgggcc tcattcaaag ccatcctgqg ccgcatgcgg 

cctgtgtgct atgggttaga caagcttaga gtagagcata gtgggtgaga gaagaggctt 
cctgtgtgcL atgggttaga caagcttaga gtagagcata qtgggtgaga gaagaggctt 
cctqtqtgct atgggttaga caagcttaga gtagagcata gtgggtgaga gaagaggctt 

cctgtgtgc:t atgggttaga caagcttaga gtagagcata gtgggtgaga gaagaggctt 

cagatgcttc tggcttgagt Lcaaatcctg actctgccac tcactagctg tgtqaccttt 
cagatgcttc tggcttqaqt tcaaatcctg actctgccac tcactagctg tgtgaccttt 
cagatgottc tggcttgagt tcaaatcctg actctgccac tcactagctg tgtgaccttt 

cagatgcttc tggcttgagt tcaaatcctg actctgccac tcactagctg tgtgaccttt 

ggcaagtgac ttaacctctc tgtqtttcca tttcctcctc tgtaaaatgg aaataataat 
ggcaagtgac ttaacctctc tgtgtttcca tttcctcctc tgtaaaatgg aaataataat 
ggcaagtgac ttaacctctc tgtgtttcca tttr:ntcctc tgtaaaatgg aaataataat 

ggcaagtgac' ttaacctctc tgtgtttcca tttcctcctc tgtaaaatgg aaataataat 

aatgatacct gtctcataaa gtggtcagga ttaaatgagt taatacatat aaagagctta 
aatgatacct gtctcataaa gtggtcagga ttaaatgagt taatacatat aaagagctta 
aatgatacct gtctcataaa gtggtcagga ttaaatgagt taatacatat aaagagctta 

aatgatacct gtctcataaa gtggtcagga ttaaatgagt taatacatat aaagagctta 

gatcagtgcc tgytgcctat aascactcca gtggtaactg ttataatttg tattatctct 
gatcagtgcc tggtgcctat aaacactcca gtggtaactg ttataatttg tattatctct 
gatcagtgcc tggtgcctat aaacactcca gtggtaactg ttataatttg tattatctct 

gatcagtgcc tggtgcctat aaacactcca gtggtaactg ttataatttg tattatctct 

"^tggtgattcc cataggtcac aggcattgaa agtgctcaaa gggccaggtg cagtggctca 
tggtgattcc cataggtcac aggcattgaa agtgctcaaa gggceaggtg cagtggctca 
tggtgattcc cataggtcac aggcattgaa agtgctcaaa gggceaggtg cagtggctca 

tggtgattcc cataggtcac aggcattgaa agtgctcaaa gggceaggtg cagtggctca 

cgcctgtaat cccagcactt Lgggaggtcg aggtgggtgg atcacctgag gtcaggagtt 
cgcctgtaat cccagcactt tgggaggtcg aggtgggtgg atcacctgag gtcaggagtt 
cgcctgtaat cccagcactt tgggaggtcg aggtgggtgg atcacctgag gtcaggagtt 

cgcctgtaat cccagcactL Lgggaggtcg aggtgggtgg atcacctgag gtcaggagtt 

cgagaccagc ctggccaaca tggtgaaacc ccgtctctac Laaaaataca aaattagcca 
cgagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaaataca aaattagcca 
cgagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaaataca aaattagcca 

cgagaccagc ctggccaaca tggtgaaacc ccgtctctac taaaaataca aaattagcca 

ggcgtgttgq cqcatgcctg taatcccagc tacttgggaa gctqaggcag gegaattqcc 
ggcgtgttgg cgcotgcctg taatcccagc tacttgggaa gctgaggcag gagaattgcc 
ggcgtgttgg cgcatgcctg taatcccagc tacttgggaa gctgaggcag gagaattgcc 


ggcgtgttgg cgcatgcctg taatcccagc 


cttgggaa gctgaggcag gagaattgcc 


tgaacccggg aggcggaggt tgcagtgagc caagatcaca ccattgcact ccagcttagg 
tgaacccggq aggcggaggt tgcagtgagc caagatcaca ccattgcact ccagcttagg 
tgaacccggg aggcggaggt tgcagtgagc caagatcaca ccattgcact ccagcttagg 


tgaacccggg aggcgg^igyi 


tgcagtgagc caagatcaca ccattgcact ccagcttagg 

caacaagagc gaaacatcgt ctcaaaaaaa aaaaaagaga aaatgctcaa atgtaataaa 
caacaagagc gaaacatcgt ctcaaaaaaa aaaaaagaga aaatgctcaa atgtaataaa 
caacaagagc gaaacatcgt ctcaaaaaaa aaaaaagago aaatgctcaa atgtaataaa 
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r:aacaagagc g<3riac5tcgt Rtcaaaaaaa aaafiaa^aga aaatQCtcaa atgtaataaa 

gacagtattc atcgagcgtt ttctctgtgc catcaqtgtg ctgagtgctt tacctgcacc 
gacagtattc atcyagcytt ttctcLcjCtjc catcagbgt*? ctgagtoctt tacctgcacc 
gacagtattc atcgagcgtc ttctctgtgc catcaqtgtg ccqagtqctt tacctgcacc 

gacagtattc atcgagcgtt ttctctgtgc catcagtgtg ctgagtgctt tacctgcacc 

ate teat tta tLttit:eqcaa accctucLgt ggtttaggtg ctaccatccc cattLggcag 
atctcattta tttcccgcaa accctcctgt qqtttaggtg ctaccatccc catttggcag 
atctcattta tttcricgcaa accct.Rctgt ggtttaggtg ctaccatncc catttggcag 

atctcattta tttcccgcaa aeccteetgt ggtttaggtg ctaccatccc catttggcag 

aagtggaaat ggagtcccct agaaggtacg tagcttgtag aaagtcatgt aattagtcaa 
aagtggaaat ggagtcccct agaaggtacg tagcttgtag aaagtcatgt aattagtcaa 
aagtgqaaat ggaqtcccct agaaqqtac? tagcttcftag aaaqtcacqc aattagtcaa 

aagtgigaaat gga^cccct. a^aaggtacv Lagcttgtag aaagtcatgt aattagtcaa 

catcgtactt ceagccactg gccagcagaa gcttccccat caattcaaat ccattgaatc 
catcgtactt ccagccactg gccagcagaa gcttccccat caattcaaat ccattgaatc 
catcgtactt ccagccactg gccagcagaa gcttccccat caattcaaat ccattgaatc 

catcgtactt ccagccactg gccagcagaa gcttccccat caattcaaat ccattgaatc 

ctaatggcaa tttaaqaggc ctgttttatt ttataggtgg qtqtttqtgt qggaaaaggg 
ctaatggcaa tttaaqaggc ctgttttatt. ttataggtgg gtgtttgtgt gggaaaaqorg 
ctaatggcaa tttaaqaggc ctgttttatt ttataggtgg gtgtttgtgt gggaaaaggg 

ctaatggcaa tttaaqaggc ctgttttatt ttataggtqg gtgtttgtgt gggnaaaggg 

ctaccagqaa actgactccq tggtcagctc cgttacqacc aaggtcaagg qcgtgqctgt 
ctaccaggaa acLgactccg tgytcagctc cgtLacgacc aaggLcaagg gcgtggctgt 
ctaccaggaa actgactccg tggtcagctc cgttacgacc aaggtcaagg gcgtgqctgt 

ctaccaggaa actgactccq tggtcagctc cgttacgacc aaggtcaagg qcgtgqctgt 

gaccaacact tctaaacttq gattccggat ctgggatgtg gcggattatg tga taccaqc 
gaccaacact tctaaacttg gattccggat ctgggatgtq gcggattatg tgataccagc 

qRt^cAsaact. tctaaacttg gattccggat ctgggatgtg gcggattatg tgataccagc 

gaccaacact tctaaacttg gattccggat ctgggatgtg gcggattatg tgataccagc 

tcaggtqtgt ctcccactgt gtcttctgtc taacactgac accttgctct ttactgactt 
tcaggtgtgt ctcccactyL gtcttctgtc taacactgac accttgctct ttactgactt 
tcaggtgtgt ctcccactgt gtcttctgtc taacactgac accttgctct ttactgactt 

tcaggtgtgt ctcccactgt gtcttctgtc taacactgac accttgctct ttactgactt 


tgcacacttg attaatqatt gacgtgqcct gagtcctgac atcaccaatg 
tqcacacttg attaatqatt qacgtqqcct gagtcctgac atcaccaatg 
tgcacacttg attaatqatt gacgtgqcct qaqtcctqac atcaccaatg 


cccagaatga 
cccagaatqa 
cccagaatga 


tgcacacttg attaatgatt gacgtgqcct gagtcctgac atcaccaatg cccagaatga 

ttgttttaqt ccaagaaaac gqtgcaqcca qatctctttt cagcttagga tctaacaqag 

ttgttttagt ccaagaaaac ggtgcagcca gatctctttt cagcttagga tctaacaqag 

ttgttttagt ccaagaaaac gqtgcaqcca gatctctttt cagcttagga tctaacagag 

ttgttttagt ccaagaaaac ggtgcagcca gatctctttt cagcttagga tctaacagag 

tgtatccagg cagctqtggt tcctcagctt gaaagcatgg caggctgaaa gccctgaaag 

Lgtatccagg cagctgtggt tcctcagctt gdddgcatgg caggctgaaa gccctgaaag 

tgtatccagg cagctqtggt tcctcagctt gaaagcatgg caggctgaaa gccctgaaag 

tgtatccagg cagctqCqqt tccCcaqctt gaaagcatgg caggctgaaa qccctqaaag 


atqagaaaac aaatgtctti 
atgagaaaac aaatgtcttc 
atgagaaaac aaatgtctt; 


siiLcctgata gaaaLtcaag tgcjdaatatc - 

aotcctgata gaaattcaa(5 tqqaaatatc . 
aatcctgata gaaattcaag tgg.^satatc . 


atgagaaaac < 


atgtctta aatcctqata gaaatt 


ag tggaaatatc acctttccac 


atttCgggtt tgcattattg tcccaaatac acgaggtaao tcttctcaga 
attttgggtt tgcattaLLg tcccaaatac acgaggtaaa tcttttcaga 
atttcgqgtt tqcaccattg tcccaaatac acqaqgtaaa tcttttcaga 


cagtgtaaaa 
cagtgtaaaa 
cagtgtaaaa 


attttgggtt tgcattattg tcccaaatac acgaggtaaa tcttttcaga cagtgtaaaa 


actttaggcc gygcacagta gctcacgtgt gtastcccag cactttggga 
actttaggcc gggcacagta gctcacgtgt gtaatcccag cactttggga 
actttaggcc gggcacagta gctcacgtgt gtaatcccag cactttggga 


ggctqcjggcG 
ggctqggqcg 
ggctggggcg 
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actttagqcc ggqcacaqta gctcacgtqt gtaatcccag cactttggga ggctggggcg 

qgcagatacc ttgagcccag gagtttgaga ccagcctggg caatgtggtg aaaccccatc 

ggcagatarc ttgagcccag gagtttqaga cnagcctygg caatgtggtg aaaccccatc 

ggcagatacc ttgagcccag gagtttgaga ccagcctggg caatgtggtg aaaccccatc 


ggcaqatacc ttgagcccag gagtttgaga ccagcctggg caatgtggtg aaaccccatc 


tcLataaaaa 
tctataaasa 
tctataaaaa 


aaaaaaaaad 
aaaaaaaaaa 
aaaaaaaaaa 


gaaaaaaaaa 
gaaaa^aa aa 


caaadgaatt 
qaaaagaatt 
gaaaagaatt 


agagaccatt 
agagaccatt 
agagaccatt 


aqagaccaaa 
aqagaccaaa 
agagaccaaa 


tr:tat.aaaaa aaaaaaaaaa gaaaaaaaaa gaaaagaatt agagaccatt agagaccaaa 

accc^aacaa gaaggaaaca gaigacaqcag cacaqgactc catcgggagc gaaaggaagg 
acccaaacaa gaaggaaaca gagacagcag cacaggactc catcgggagc gaaaggaagg 
acccaaacna gaaggaaaca gagacagcag cacaggactc catcgggaigc gaaaggaagg 


acccaaacaa 'gaaggaaaca gagacagcag cacaggactc catcgggagc gaaaqqaagg 

qaagacattg tttactgctg catactagtc ttcaggctqt taatctgcaa agctccaagt 
gaagacattg tttactgctg catactagtc tt.caggctgt taatctgcaa agctccaagt 
gaagacattg tttactgctg catactagtc ttcaggctgt taatctgcaa agctccaagt 


gaagacattg tttactgctg catactagtc ttcaggctgt taatctgcaa agctccaagt 

taaacattga ctcfcttngLc ctggtaacafj ctcLgcacca cacagtcaaa cccgacatag 

taaacattga ctctttagtc ctggtaacag ctctgcacca cacagtcsaa cccgacatag 

taaacactga ctctttagtc ctggtaacag ctctgcacca cacagtcaaa cccgacatag 


taaacattga ctctttagtc ctggtaacag ctctgcacca nacagtcaaa cccgacatag 

ccctccagat ggcctatgtg ctaagggacc gagggcgacc agtagagata agctgccaac 
ccctccaqac ggcctatgtg ctaagggacc gagggcgacc agtagagata agctgccaac 
ccctccagat ggcctatgtg ctaagggacc gagggcgacc agtagagata agctgccaac 


ccctccagaL ggcctatgtg ctaaijggacc gagggcgacc agtagagata agctgccaac 

tcttgqctca gagctcttgc cagggaaaac aagaactcca gaaactcttt cttcaggtgt 
tcttggctca gagctcttgc cagggaaaac aagaactcca gaaactcttt cttcaggtgt 
tcttggctca gagctcttgc cagggaaaac aagaactcca gaaactcttt cttcaggtgt 


tcttggctca gagctcttgc cagggaaaac aagaactcca gaaactcttt cttcaggtgt 

gttcttcccc ctgccttctc aagagaaggg tcttgtgygg tccccayaga ggayaccgag 

gttcttcccc ctgccttctc aagagaaggg tcttgcgggg tccccagaga ggagaccgag 

gttcttcccc ctgccttctc aagagaaggg tcttgtgggg tccccagaga ggagaccgag 


gttcttcccc ctgccttctc aagagaaggg tcttgtgggg tccccagaga ggagnccgag 

ctaatgagcc ataaaagtca ggaggaaggg ccaggtgcgg tggctcatgc ctgtaatccc 
ctaatqagcc ataaaagtca qqaggaagqg ccaggtgcgg tggctcatgc ctgtaatccc 
ctaatgagcc ataaaagtca ggaggaaggg ccaggtgcgg tggctcatgc ctgtaatccc 


ctaatqagcc ataaaagtca ggaggaaggg ccaggtgcgg tggctcatgc ctgtaatccc 

agcactttgg gaggccgagg tcggtgtatc atgaggtcaa gagattgaga ccatcctggc 

agcactttgg gaggccgagg tcggtgtatc atgaggtcaa gagattgaga ccatcctggc 

agcactttgg gaggccgagg tcggtgtatc atgaggtcaa gagattgaga ccatcctggc 


agcactttgg gaggccgagg tcggtgtatc atgaggtcaa gagattgaga ccatcctggc 

taacatggtg aaaccctgtc tactaaaaat acaaaaatta gtggggtgtg gtggtgggtg 
taacatggtg aaaccctgtc tactaaaaat acaaaaatta gtggggtgtg gtggtgggtg 
taacatggtg aaaccctgtc tactaaaaat acaaaaatta gtggggtgtg gtggtgggtg 


taacatggtg aaaccctgtc tactaaaaat acaaaaatta gtggggtgtg gtggtgggtg 

ccLgtaqtcc cagctacLaq ggaggctgag gcaggagaat ctcttgadcc tgggaggcqg 
cctgtagtcc cagctactag ggaggctgag gcaggaqaat ctcttgaacc tgggaggcgg 
cctgtagtcc cagctactag ggaggctgag gcagqaqaat ctcttgaacc tgggaggcgg 


cctgtiigLcc cayctactag ggaggctyjg gcaggagaat ctcttgaacc tgggaggcgg 

aqgttgtaqt qagccgagat aqtqccactg cactccagcc tqgtgacaga gcaagactcc 

aggttgtagt qagccgagat agtgccactg cactccagcc tggtgacaga gcaagactcc 

aggttqragt gaqccgagat agtgccactg cactccagcc tggtgacaga gcaagactcc 


agqttqcagc qagccgagat agtgccactg cactccagcc tggtgacaga gcaagactcc 

gtctcasaaa aaaagatcaa gtggaacaqg ggatttgcag ttggctiggag ggggcagtcc 

qtcLcaaaaa aaaagatcaa gtggaacaqg ggattrgcag ttggctggag ggggcagtcc 

gtctoaaaaa aaaagatcaa gtggaacagg ggatttgcag ttggctggag ggggcagtcc 
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aaa aaaagatcaa gtggaacaqg ggacttgcag tcggctggsg gggycciiJtCi; 


Cqttarjcntg 
tqttagcatq 
tgt tagcatg 


tacctgtctc 
ta<Jctgtctc 


agcrciqagaga 
Figf agagnqn 
aqcaiqaqa,ga 


(jggttgcctc 
cqgttqcctc 
cggttgcctc 


ctgtgcaggc 
ctgE gcaggc 
ctqtqcaqgc 


ataguctLy t 
cltngcnttqt 


tgttagcatg tacctgtctc agcagagaqa cqgttgcctc ctgtgcaqqc ataqccttgt 


cccqqtgagc 
cccggcgagc 
cccggtgagc 


tctgggaiatq 
tctgggaatg 
tctgqqaatg 


tcatgctcgg 
tcatgctcgg 
rcatgctcgg 


tgtgtttact 
tgtgtLLact 
tgtgtrr.act 


gggcagatga 
gggcagatga 
gggcragat.ga 


ggaagggcac 
ggaagggcac 
ggaaqggcas 


cccggtgagc tctgggaatg tcatgctcgg tgtgtctact gggcagatga ggaagggcac 


aagccccgct 

aagccccqct 
aagccccgct 


ttCgtccctt 

tttqtccctt 
tttgtccctt 


ggtggaaagg 

gqtqqaaagg 
ggtggaaagg 


agcaccccac 

aqcaccccac 
aycaccccac; 


aactcagcct 

aactcagcct 
aactcag<jct 


tccgtiggtgg 
tccgtggtqg 
tccgcygtgg 


agccocgct tttgtccctt ggtqgaaagq aqcaccccac aactcaqcct tccqtqqtgq 


atctgggcgq 
atctgggcgg 
atctqggcgg 


gqgccgtqtg 
gggccgtgtg 
qggccgtgtg 


aqttccccag 
agttcct:cag 
agtiCccccag 


qgctttcctc 
qgctLtcctc 
ggctttcctc 


atgagctttc 
atgagctttt: 
atgagcttrc 


acgacgtctg 
acgacgtLiLcj 
acgacgrcr.g 


atctgggcgg gggccgtgtg agttcccnag ggcttf-.cctc aCgagctCCc acgacgtctg 


tgttctttgt 
tgttctttqt 
tgttctttgt 


agatggaaca 
agatggaaca 
agatggaaca 


cagtcatcta 
cagtcatcta 
cagtcatcLa 


tttguagccc 
tctgnagcee 
tttgcagccc 


ct Lggctcat 
crtggctcat 
cttggctcat 


ctccagautg 
ctccagactg 
ctccagactg 


tgttctttgt agatggaaca 

aaattaccct taatttgaag 
aaattaccct taatttgaag 
aaattaccct taatttgaag 


tttqcagccc cttggctcat ctccagactg 


taqatsagtg 
tagataagLg 
taqataagtg 


ataqgaaaca 
ataggaaaca 
ataggaaaca 


ggccsgacgt 
qgccaqatgt 
qgccagatgt 


ggtggttcat 
ggtqgttcat 
ggtggttcat 


aaattaccct taatttgaag tagataagtg ataggaaaca ggccagatgt ggtggttcat 


gcctgtaatc 
gcctgtaatc 
gcctgtaatc 


tcaacacttt 
tcaacacttt 
tcaacacttt 


Lggaggccga 
t.ggaggccga 
tggaggccga 


ggcagaagga 
ggcagaagga 
ggcagaagga 


tcacttgagc 
l:car:t:t:gagc 
tcacttgsqc 


ccaggagttt 
ccaggagttt 
ccaggagttt 


gcctgtaatc tcaacacfctt tggaggccga ggcagaagga tcacttgagc ccaggagttt 


gaggctgaag 
gaggctgaag 
gaqgctgaag 


tgaactgtga 
tgaactgtga 
tgaactgtga 


tcacgceaet 

tcacgccact 
tcacgceaet 


gcactecaqc 

gcactccaqc 
gnactccagc 


tagggcaaca 
tagggcaaca 
tagggcaaoa 


gagnaagaco 
gaqcaagacc 
gagca-igacc 


gaggctgaag tgaactgtga tcacgceaet gcactccaqc tagggcaaca gagcaagacc 


ccaactctaa 
ccaactctaa 
ccaactctaa 


aagagagaga 
aagagagaga 
aagagagaga 


aagaaggaga 
aagaaggaga 
aagaaggaga 


gataggaaac 
gataggaaac 
gataggaaac 


yaaaagtaga 
gaaaagtiiga 
gaaaagtaga 


Gcatactgca 
ccatactgga 
ccatactgga 


ccaactctaa aagagagaga aagaaggaga gataggaaac gaaaagtaga ccatantgga 


ttqttccaag 
ttqttccaag 
ttgttncaag 


tataactcag 
tataactcag 
tataactcag 


gagagtogta 
gagagtcgta 
gagagtcgta 


ggagcggcct 
gqagcggcct 
gqagcggcct 


tctetaaeafi 
tctctaacac 
tctctaacac 


caaatgcaga 
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caaatgcaga 


ttgttccaag tataactcag gagagtcgta gqagcggcct 


:Lctaacac caaatgcaqa 


gtgtytttgt 
gtgtgtttgt 
qtgtgtttqt 


acat tgaacci 
acattgaaca 
acattgaaca 


cctgggaacq 
cctqggaacg 
cctgqgaacg 


tqacaggtgg 
tqacaggtgg 
tgacaggtgg 


cgiggagaqag 
cgggagagag 
cgggagagag 


qctqcaqctc 
qctgcagctc 
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atgccacatc 
atgccacatc 
atgccacatc 


ctoacgcctc 
ctcacgcctc 
ctcacgcctc 


ntoctgtggo 
ctcctgtggc 
ctcctgtggc 


cttctccttg 
Cttctccttq 
cttctccttg 


tctcccaaca 
tctcccaaca 
tctcccaaca 


tagttcgccc 
tagtttgccc 
tagtttgc-cc 


atgccacatc ctcacgcctc utcctgtggu cttctccttg tctcccaaca tagtttgccc 


taggcctttg 
caggcettcg 
taggcctttg 


gaggacQcca 
gaggacgcca 
gaggacgcca 


gctatttcta 
gctat ttcta 
gctatttcta 


gagcttccac 
gagct tccac 
qaqcttccac 


cacttgtcag 
cacttgtcag 
cacttgtcag 


accqtggqqt 
accgtggggt 
accgtqgggt 


taggcctttq gaggacgcca gctatttcta gagcttccac cacttQtcag accqtggggt 


tggggatgtc 
tggggatgtc 
tggggatgtc 


tacagactgq 
tacagactgg 
tacagactgg 


tcggtttctq 
ttggtttctg 
ttggtttctg 


ogttgccctg 
agttgccctg 
agttgccctg 


atccccQoga 
atccccaaga 
atccccaaga 


tqgctcccac 
tggctcccac 
tqgctcccac 
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tggggatgtc tacaqactqq ttqqtttctq aqttgccctg atccccaaga t^gctcccac 

tactqactgt tatcctcagg agcgtgaaat gctaasagct ttgqqt:aLci. ccttttaaaa 
tactqactqt tatcctcacjg aqLgttjariat gct.aaaagct ttgggcatct ccttttaaaa 
tactgflnt:gt t.atcctcsgg agtgtgaaat gctaaaagct ttgggcatct ccttttaaaa 

tactqactgt tatcctcagg agtgtgaaat gctaaaagct ttgggcatct ccttttaaaa 

tgcctgggcc cagccaggcg cagtggc^tta trrjuetgtagL uccagcantL ttggaggcca 
tgcctgggcc cagtMiaggcg cagtggctta cgcctgtagt cccaqcactt ttggaggcca 
tgcctgggcc cagccaggcg cagtggctta cgcctgtagt cccaqcactt ttgqaggcca 

tgcctgggcc cagccaggcg cagtggctta cgcctgtagt cccaqcactt ttggaggcca 

aggcaggagg atcacctgag cccagg^at ( c aa gacfz^^gcc: tggycaatat sytgaqaccc 
aggcaggagy atcactt.gag ctcaggattc aagaccagcc tgggcaatat agtgagaccc 
aggcaggagg atcacttgag ctcaggattc aagaccagcc tgggcaatat agtqaqaccc 

aqqcagqagq atcacttgag ctcaggattc aagaccagcc tgggcaatat agtgagaccc 

catcttgatt ggatgggtgg tggatygatg gatggatgga tggatggatg gatggatgga 
catcttyatt ggatgggtqg tggatggatg gatggatgga tggatggatg gatggatgga 
catcttgatt ggatgggtqg tggatggatg gatggatgga tggatggatg gatggatgga 

catcttgatt ggatgggtgg tggatggatg gatggatgga tqgatgqatg gatggatgga 

taqatagata gaiagataga tagatagata gatagataga tggaaagaaa gaaagaatag 
tagatagata gatagataga tagatagata gatagataga tggaaagaaa qaaagaataq 
tagatagata gatagataga tagatagata gatagataga tggaaagaaa gaaagaacag 

tagatagata gatagataga tagatagata gatagataga tggaaagaaa gaaagaatag 

ataaatgata ggataggaga aagaaaqgqa aagaaagaaa agagaagaqa aaagaaaqaa 
ataaatgata ggataggaga aagaaaggga aagaaagaaa agagaagaqa aaagaaaqaa 
ataaatgata qqatagqaga aagaaaggga aagaaagaaa agagaagaqa aaagaaagaa 

ataaatgata ggataggaga aagaaaggga aagaaagaaa agagaagaga aaagaaagaa 

aagggatagg atagatgaca gatgatagga tagattagat aggtgataga taatcgataa 
aagggatagg atagatgaca gatgatagga tagattagat aggtgataga taatcgataa 
aagggatagg atagatgaca gatgatagga tagattagat aggtgataga taatcgataa 

aagggatagg atagatgaca gatgatagga tagattagat aggtgataga taatcgataa 

atagaasaca gatgatagat atagqatega ttaqatagat agatagatag atagatagac 
atagaaaaca gatgataqat ataggataga ttaqatagat agatagatag atagatagat 
atagaaaaca gatgatagat ataggataga ttagatagat agatagatag atagatagat 

atagFiaaaca gatgatagat ataggataga ttagatagat agatagatag atagatagat 

aqatagacag ataatagata qattagatta gatggatgga caccaqcccc caatgagagg 
agatagacag ataatagata qattagatta gatggatgga caccagccf.:c caatgagagg 
aqatagacag ataatagata gattagatta gatggatgga caccagcccc caatgagagg 

agatagacag ataatagata qattagatta gatggatgga caccaqcccc caatgagagg 

aqgatccaac tagaatcttt tggggtettt gccttttatt ccttatatga ggaggtcatt 
aggatccaac tagaatcttt tggggtettt gecttttatt ccttatatga ggaggtcatt 
aggatccaac tagaatcttt tggggtettt gccttttatt ccttatatga ggaggtcatt 

aggatccaac tagaatcttt tggggtettt gcscttttatt ccttatatga ggaggtcatt 

atcagcaatt cttcagtctc atccctagca aactgctttc cagaaactct. aaatcaatta 
atcagcaatt cttcagtctc atccctagca aattgctttc cagaaactct aaatcaatta 

atcagcaatt cttcaqtctc atccctagca aattgctttc cagaaactct aaatcaatta 

atcagcaatt cttcagtctc atccctagca aattgctttc cagaaactct aaatcaatta 

tttggcacct ttatacatat aaqqaggaaa cagaacctitt tcaaagagaa atgtcacttg 
tttgqcacct ttiitacatat aaggaqgaaa caqaaccttt tcaaagagaa atgtcacttg 
tttggcacct ttatacatat aaggaggaaa cagaaccttt tcaaagagaa atqtcacttq 

tttggcacct ttatacatat aaggaggaaa cagaaccttt ccaaagagaa atgtcacttg 

ctgaccctga ccgagtcttc tctaqaatgt ttyatttata atttcagtte ggaaagaaaa 
ctgaccctga ccgagtcttc tcM.aqaatgt ttgatttata atttcagtta ggaaagaaaa 
ctgaccctga ccgagtcttc tctagaatgt ttgatttata atttcagtta ggaaagaaaa 

ctgaccctga ccgagtcttc tctagaatgt ttgatttata atttcagtta ggaaagaaaa 

ccagccagat ttgagcccca tgaggcctca tgtttttatt cctctgagat aatctggttg 
ccagccagat ttgagcccciJ tgaggcctca tgtttttatt cctctqagat aatctggttg 
ccagccagat ttgagcccca tgaqgcctca tgtttttatt cctctgagat aatctggttg 


FIGURE 20 


Patent Application Publication Jan. 29, 2004 Sheet 28 of 51 US 2004/0018533 Al 


>Pcirtl of 1 
>NCBl b29 
>NCBI Old 


' 8130B1 
#13081 


>Partl of Rns.#103Hl 
>NCHI b20 #13141 
>NCBI old ttl314l 


>Partl of Ens. #10441 
>t1CBl b29 #13201 
>NCBI Old #13201 


>Partl of Fns.#10S01 
>NCBI b25 #13261 
>NCBI Old #13261 


>Partl of Eng. #10561 
>NCB1 b29 #13321 
>NCBI old #13321 


>Partl of Sns,ttl0€2l 
>NCBI b29 #13381 
>NCBI old tH3391 


>Partl of Ens. #10661 
>HCBI b29 »13441 
>NCBI old #13441 


>FartLl of Ens. #10741 
>HCBI b29 #13501 
>11CBI old #13501 


>Pdrtl of Ens.ttlOBOl 
>KCBI bZ9 #13561 
>NCBI aid #13561 


>Partl of Ens. #10061 
>NCBI b29 #13621 
>NCBI Old #13621 


>Partl of I 
>NCBI 1>29 
>NCBI Old 


.#109:^1 
«i36ai 

#13681 


>Partl of Ens. #10981 
>NCB1 1529 #13741 
>MCBI old #13741 


>Pa-ctl ot tns. #11041 
>NCBI b29 #13801 
>NCBr old #13801 


>PaEtl of Ens. #11101 
>NCBI b29 #13861 
>MCBI Old #13861 


ccaqccaydt ttgaycccca tgaggcctci 


gcagggaatg 

gcagggaatg 


ttfr.tfagcac 
tttttagcac 
tttttagcac 


ttgttgqctc 
ttgttggctc 
ttqtitggctc 


tqtttttatt cctctgagat . 

tttccctcag taagtcaagt 

tttcccncag taagtcaagt 

tttccctcag taagtcaagt 


ttgattgaga 
ttqattgaqa 
Ltqattgaga 


gcagggaaLg CLLttagcac ttgttggctc tttccctcag taagtcaagt rtgattgaga 


aaccattgtt 
aaccattgtt 
aaccattgtt 


tcagtttgca 
tcagtttgca 
tcagtttgca 


gacccccccc 
gacccccccc 
gacccccccc 


ttttttcaac tgcagtcaaa 
tttcttcaac tgcagtcaaa 
tttttttaac tgcagtcaaa 


ttcacttata 
ttcacttata 
ttcacttata 


aaccattgtt tcagtttgca gacccccccc tttttttaac tgcagtcaaa ttcacttata 


gaaaotaacc 
gaaactaacc 
gaaacraacc 


acttcaagcc 
attttaagtc 
attttaaglc 


gqqcacaqtg 
gggcacagtg 
qgqcauagtg 


gctcatgctt gtiaaccccag 
gctcatgctt gtaatcccag 
gctcatgctt g^aatcccag 


cactttggga 
cactttggga 
cactttggga 


qaaactaacc attttaagtc gqgcacagtg gctcatgctt gtaatcccag cactttggga 


ggctgaggca 
ggctgaggca 
ggctgaggca 


ggtagatcac 
ggtagatcac 
ggtagatcac 


ttgaggccag 
ttgaggccag 
Ltgaggcoag 


gagttggaga ccagcctagc 
ga^tbggaga ccagcctaqc 
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caacatggtg 
caacatqgtij 
caacatggtg 


ggctgaggca ggtagatcac ttgaggccag gagttggaga ccagcctagc caacatggtg 


aaotcetatc 
aaatcctatc 
aaotcetatc 


tctactaaag 
tctactaaag 

tctactaaag 


atacaaaaat 
iitacaaaaa<- 
atacaaaaat 


tagccaggtq tgqtgatgta 

t-aqccaqgtg tgqtgatgta 
tagccaggtq tgqtgatgta 


cacctqtagt 
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cccagctact 
cccagctact 
cccagctact 


caggagggtg 
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aggctgqaga 
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atcqcttgaa cccagqaggc 
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gqaqattqca 
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lagctact caggagggtg aggctggaga atcgcttgaa cccaggaggc ggagattgca 


gtaagcca.aq 
gtaagccaag 
gtaagccaag 


atcgtgccac 
atcgtgccac 
atcqrgccac 


tyccctgcag 
tgccctgcag 
tgncctgcag 


cctgggcgac agagcaaggc 
cctgggcgac agagcaaggc 
cctgggcgac agagcaaggc 


tctgtctcaa 
tctgtctcaa 
tctgtctcaa 


gtaaqccaag atcgtgccac tgccctqcag cctgggcgac agagcaaggc tctgtctcaa 


ataaataaat 
ataaataaat 
ataaataaat 


aaataagaaa 
aaataagaaa 
aaataagaaa 


gaaattaacc 
gaaattaacc 
gaaattaacc 


attttaaaat gaaaaattca 
attLttaaaat gaaaaattca 
attttaaaat gaaaaattca 


gcBcatttag 
gcacatttag 
gcacatttsg 


ataaataaat aaataagaaa gaaattaacc attttaaaat gdaaaaLtua qcctca L L tag 


tgcattcaca 

tgcattcaca 


gtgttat^ca 
qtgttatgca 
gtgttatgca 


agcatcacct 
agcatcacnt 
agcatcacct 


ctatctaqtt caaaacattt 
ctatctagtt caaaacattt 
ctatctagtt caaaacattt 


ccatcacccc 
ccatcacccc 
ccatcacccc 


tgcattcaca qtgttatgca agcatcacct ctatctagtt 


aaacattt ccatcacccc 


aaagggaagc 
aaagggaagc 
aaagggaagc 


ctcgtatgtt 
ctcgtatgtt 
ctcgtatgtt 


tccccattcc ccacttctct 
tccccattcc ccacttctct 
tccccattcc ccacttctct 


cactccctgg 
cactccctgg 
cactccctgg 


aaagggaagc ctcgtatgtt acacaacca< 


tccccattcc ccacttctct cactccctgg 


coacccgtaa 
caacccgtaa 
caacccgtaa 


gtgcttCgCc 
gtgctttgtc 
gtgctttgtc 


tctatggact 
tctatggact 
tctatqgact 


tgactgttct ggaaatttca 
tgactgttct ggaaatttca 
tgactgttct ggaaatttca 


catcaatgta 
catcaatgta 
catcaatgta 


caacccgtaa gtgctttgtc tctatggact tgactgttct ggaaatttca catcaatgta 


atggtatcct 
atggtatcct 
atggtatcct 


gtgtggcctt 
gtgtggcctt 
qtqtggcctt 


tgtgtctqgc 
tgtqtctggc 
tgtgtctqgc 


ttctttcact cagcatcatg 
ttctttcact cagcatcatg 
ttctttcact cagcatcatg 


tttttggggt 
tttttggqgt 
tttttggggt 


atggtatcct gtgtggcctt tgtgtctqgc ttctttcact cagcatcatg tttttggggt 


tcatccatgt 
tcatccatgt 
tcatccatgt 


tgtagcctqt 
tgtagcctgt 
tq tagcctgt 


atcagtgctt 
atcagtgctt 
atcagtgctt 


tattccttta ttttattttt 
tattccttLd ttttatLLtt 
tattccttta ttttattttt 


tqaqatqaaa 
tgagatgaaa 


teatccatgt tgtagcctgt atcagtgctt tattccttta ttttattttt Cgogatgaaa 


tctcactctg 
tctcactctg 
tctcactctg 


tcacccaggc 
tcacccaggc 
tcacccaqgc 


tggagtgcac 
tggagtgcac 
tggagtgcac 


Cggcgtgatc tcggctcact 
tggcgtgatc tcggctcact 
tggcgtgatc tcggctcact 


gcaacctcca 
gcaacctcca 
Qcaacctcca 
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tctcactctg tcacccaggc tggagtgcac tggcqtgatc tcggctcact gvetacctcna 


>Partl of Erks. #11161 
>KCBI b29 ftl3921 
>KCBI old #13921 


>eartl ur Eiis.»11221 
»ICBX b29 #13981 
>NCBI old ft 13 981 


>Partl of Ens. » 11281 
>NCBZ fa29 fll4041 
>NCBI CPld H 14 041 


>Partl of Eiis. #11341 
>taCBI b29 #14101 
>NCBZ old #14101 


>PaTtl of Ens. #11401 
>HCBI b29 #14161 
>NCBI old #14161 


>Partl of Ens. #11461 
>NCBI b29 #14221 
>NCB1 old #14221 


>Fartl of Ens > #11521 
>NCBI b29 #14281 


>MCBI old 


>Partl o£ Ens. #11581 
>NCBI b29 #14341 
>HCBI old #14341 


>Partl of Ens. #11641 
>NCHI b29 #14 401 

>KCai old #14 401 


>Partl of Erva. #11701 
>HC:bi b29 #144£1 
>]!ICBI old #14 461 


>PartT of Ens. #11761 
>NCBI b29 #14 521 

>MCBI Old #14521 


>PacCl of Ens. #11821 


>NCBX b29 
>tlCBI Old 


»145S1 
tfl4b81 


>Pactl of Ens.BllBai 
>I4CBI b2d #14641 
>HCBI old 014&I1 


i-Pactl of Ens. 1111941 
>NCB1 b2d #14701 
>NCB1 old #14701 


cctcccaqqt 
r.ntccc-aggt 
Lrc-tcccaggt 


tccagtgatt 
tcci^gtgsirt 
uccsgtgst-t 


ctcctggctc 
ctcctqgct.c 
ctcct-ggctc 


agccccctga gtagctgggs ttacagqtqc 
aqcr.ccctqa gtagctggga ttacaQgtgC 
agccrccctga ijtagctgggei ttacaggtgc 


eetcccaggt tccagtgatt ctcctggctc agccccccga gtagctggga ttacaggtgc 


ccgcciicjcac 
cogccaccao 
ccgccaccac 


accuggctcc 
acccggcccc 
acccgqctcc 


ttttgtgtgt 
ttttgtgcgt 
ttttgtqtgt 


gcgtgtgtct gtgtgtgttt gtgtgtgtat 
gogcgtgtct gtgtgtgttt gtgtqtgtgt 
gcqcqtgtct gtgtgtgttt qtgtgtgterC 


ccqtrcarrcac accrggctcc tt.ttgtgtgt gcqtgtgtnt. gtgtgtgttt gtgtgtgtgt 


gtgtqtgtgt 

gtgtgtgtgt 
gtgtgtgtgt 


qtqtgtgcgt 
gtgtgtgtgt 
gtgtgtgtgt 


ttttagtaM 
ttttagtaca 
ttttagtaca 


aagtttcacc atgttggcca gqctaatctg 
aagtttcocc atgttggcca gqctaatctg 
aaqtttcacc atgttggcca ggctaatctg 


gtgtgtgtgt gtgtgtgtgt ttttagtaca aagtttcacc atgttggcca ggctaatctg 


gaatkcfctga 
gaatLcttga 
gaattcttga 


cctcaggtga 
t:c:tcaggtga 
cct.caggtga 


tcctcfitgcc 

tcctccrtigcc 
tcctcctgcc 


tcagcctccc aaagtgctgg aattacaggt 

tcagcctccc aaagtgctgg aattacaggt 
Ccagcctccc aaagtgctgg aattacaggt 


gaattcttga cctcaggtga tcctcctgcc tcagcctccc aaagtgctgg aattacaggt 


gtgagccBcc 
gtqagccacc 
gtgagccacc 


aLgcctggac 
atgcctggac 
atgcctggac 


Oct teat tec 
ccttcattcc 
ccttcattcc 


tttttattca tggctgaatc acattccatc 
tttttattca tggctgaatc acattccatc 
tttttattca tggctgaatc acattccatc 


gtqagccacc atgectggar: ccttcattcc tttttattca tggctgaatc acattccatc 


atgtgactaq 
atgtgactsg 
atgtgnctag 


actaggacag 
actagga cag 
actaggacag 


acatttgggc 
acatttgggc 
acatttgggc 


tgtttccacc attttttctt tttttctgag 

tgtttccacc attttttttt tttttttgag 
tgtttccacc attttttttt tttttttgag 


atgtgactag actaggacag acatttgggc tgtttccacc attttttttt tttttttgag 


acagagtctc 
acagagtctc 
acagagtctc 


gtgtcatcca 
gtgtcatcca 
gtgtcatcca 


ggctggagtg 
ggctggagtg 
ggctggagtg 


cagtggogcg atgtcggctc gctgcaacct 
cagtggcgcg atgtcggctc gctgcaacct 
cagtggcgcg atgtcggctc gctgcaacct 


acagagtctc gtgtcatcca ggctggagtg cagtggcgcg atgtcggctc gctgcaacct 


cegactcctxi 
ccgactcccg 
ccgactcccg 


ggttcaagca 
ggttcaagca 
ggttcaagca 


attetcctgc 
attctcctgc 
attetcctgc 


ctcagcctcc caagtagctg ggactacagg 
ctcagcctcc caagtagcbg ggactacagg 
ctcagcctcc caagtagctg ggactacagg 


ccgactcccg ggttcaagca attutcctgc uLcagcctcc caaqtagotg ggacLacagg 


tgcccgccat 
tgcccgccat 
tgcccgccat 


cacacccggc 
cacacccggc 
cacacccggc 


taatttttgt 
taatttttgt 
taatttttgt 


atttttagta gagacggggg tttcractgtg 
atttttaqta gagacggggg tttcactqtg 
atttttagta gagacggggg tttcactqtg 


tgcccgccat cacacccggc taatttttgt atttttagta gagacggggg tttcacfcgtg 


ttgaccaggc 

ttgaccaggc 
ttgaccaggc 


tggtcttgaa 
tggtcttgaa 
tggtcttgaa 


ctcctgacct 
ctcctgaccb 
ctcctgacct 


cgtgatccgc ecgcctcagc cccccaaagt 
cgtgatccgc ecgcctcagc ctcccaaagt 
cgtgatccgc ecgcctcagc ctcccaaagt 


ttgaccaggc tggtcttgaa ctcctgacct cgtgatccgc ecgcctcagc .ctcccaaagt 


gctgggatta 
gcLgggatta 
gctgggatta 


caggcttgag 
caggcttgag 
caggcttgag 


ccaccgcgcc 
ccaccgegce 
ccaccgcgcc 


cgacctgttt ccaccttttg gctcttgtga 
cgacctgttt ccaccttttg gctcttgtga 
cgacctgttt ccaccttttg gctcttgtga 


gctgggatta caggcttgag ccaccgcgcc cgacctgttt ccaccttttg gctcttgtga 


aaagtgttgc 
aaagtgttqc 
aaagtgttgc 


tctgaatatt 
tctgaatatt 
tctgaatatt 


tatgtacaag 
tatgtacaaq 
tatgtacaag 


gacLtgtttg agtcaotgtt ttcc^attggg 
qacttqtttg agtcactgtt ttccattgqg 
gacttgtttg agtcactgtt ttccattggq 


aaagtgttgc tctydatatt tatqtacaag gacttgtttg agtcactgtt ttccattggq 


ttataaactt 
ttataaactt 
tLataaacLL 


tgatgggaga 
tgatgggaga 
tgatgggaga 


gagcggqaga 
gagcgggaga 
gagcgggaga 


gcecgcqgte agtgtttgag ttgtcctctt 
gcacgcggtc agtgtttgag ttgtcctctt 
gcacgcggtc agtgtttgag ttgtcctctt 


ttataaactt tgatgggaga gagcgggaga gcacgcggtc agtgtttgag ttgtcctctt 


cagtcagcgt 
cagtcagcgt 
cagtcagcgt 


ctqcctttct 
ctgcctttct 
ctqcctttct 


tggctctccg 
tggctctccg 
tggctctccg 


tgagtcctct gagt^tggct tgcccgtgct 
tgagtcctct gagcgtggct tgcccgtgct 
tgagtcctct gagcgtqgct tgccegcgct 
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>Partl ot Ens.«12001 
>HCBI b2 9 #147(31 
>l9Cm old »14761. 

#14761 

>Partl of Ens. #12061 
>NCBI b29 #14 821 

>HCBI oirt #1 15 82 1 

#14821 

ipartl of Ens. #12 12 I 
>NCB1 oia #14681 

>Fari:l of Ens. #12181 
^NCBI b29 « 1-1 941 

>NCBI old #14941 

#14941 

>Partl of Ena. #12241 
>NCBI b29 #15001 
>KCBI-Dld #15001 

(JlSOOl 

>Psictl of Ens- #12301 
>NCEI b29 #15061 
>NCBI old #15061 

#15061 

>Partl Of Ens.#123«l 
iMCBI h29 #15121 
>MCBI Old #15121 

#1&121 


>Partl of Ens. #12421 
>HCBI b29 #15181 
>1>ICBI old #15181 

#15181 

>Partl of Ens.«l2481 
>NCBI bZg fll52'll 
>NCBI old » 15241 

«15241 

>partl of Ens. #12541 
>NCBT b29 #1530] 
>NCBI old #15301 

»15301 

>Partl of Cna.H 12601 
>NCBI bZ9 #15361 
>UCB1 Old #13361 

015361 


>Partl of Ens-«12661 
>NCBI b29 fil5421 
>NCBI old #15321 

#15421 

>Partl of Ens- #12721 
>WCB1 b29 #15481 
>NCBI old #15481 

ftl5481 

>Partl of Ens. #12781 
>WCBI b2 9 1115541 
>NCBI old #15541 


cagtcaqcgt ctqcccntct tggctctccg LgaijLcctct gcigcqtgqcl tgcccytgct 

gtctcccctc tgcaggaqga aaactccutc tkcgLcatga ccaacgtgat cctciaceatg 

gtctcccctc tgcaggagga aaactccctc ttcgtcatga ccaangtgat nntcaccatg 

gtctcccccc tgcaggagga aaactccccc ttcgtcatga ccaacgtgat cctcaccatg 


gtctcccsctc tgcaggagga aaactccctc ttcgtcatga ccaacgtgat cctcaccatg 

aaccagacac agggcctgtg ccccgaggta ggaggccccc gggoagagcc ccaggccccs 

aaccagacac agqgcctgtq ccccgaggta qgaggccccc gggaagaqcc ccagqcccca 

aacr:sgaeac sqgqcctgtq ccccgaggta gg^ggccccrc gggaagagcc ccaggcccca 


aaccagacac agggcctgtg ccccgaggta gqaggccccc gggaagagcc ccaggcccca 

caccoctetc cacgcctgtc eaci?tgtQtg tggggcc9gg ccacgtggac tttcttti.cg 
cacccctctc cacgcctgtc cacctgtgtg t-ggggcc^? ccac^tggac tttcttttcg 
cacccctctc cacgcctgtc cacc^gtgtg tggggccggq c<.»cgtggac: tttcttttcg 


cacccctctc cacgcctgtc cacctgtqtg tgggqccggg ccacgtqgac tttcttttcg 

ctccttcttt ttccagattc cagatgcgac cactgtgtgt. aaatcagatg ccagctgtao 
ctccttcttt ttccagattc cagatgcgac cactgtgtgt aaatcagatg ccagctgtac 
ctccttcttt ttccagattc cagatgcgac cactgtgtgt eiaatcagatg ccagctgtac 


ctccttcttt ttccagattc cagatgcgac cactgtgtgt aaatcagatg ccagctgtac 

tgccggctct gccggcaccc acagcaacgg tacgagcttg tggcetcctg gggagggcgg 

tqccggctct gccqqcaccc acagcaacgg tacgagcttg tggcctcctg gggagggcgg 

tgccggctc:^ gccggcaccc acagcaacgg tacgagcttg tggcctcctg gggaggqcgg 


tgccggctct gccggcaccc acagcaacgg tacgagcttg tggcctcctg gggagggcgg 

CCCCtgagca gatcgccccc actgtggagc gtctctgata gagaaatctt cccaattcct 
cccctgagca gatcgccccc ^ictgtggagc gtctctgata gagaaatctt cccaattcct 
ucrcctgagca gatcqccccc actgtggagc gtctctgata gagaaatctt cccaattcct 


cccctgagca gatcgccccc actgtggagc gtctctgata gagaaatctt ccicaattccL 

tcacatgacc ctggqtgagc caggtgccga ggctgqqgtc ctqqagcccc tctacattca 
tcacatgacc ctgggtgagc caggtgccga gqctggggtc ctggcsgcccc tctacattca 
tcacatgacc ctgggtgagc caggtgccga ggctggggtn ctggagcccc tctacattca 


tcacatgacc ctgggtgagc caggtgccga ggctggggtc ctggagcccc tctacattca 

ctgctgtcat tggagcccca caagccatcc cagctcttgc cctactgttt ttttttttgt 
ctgctgtcat tggagcccca caagccatcc cagctcttgc cctactgttt ttttttttgt 
ctgctgtcat tggagcccca aaagccattzc cagctcttgc cctactgttt ttttttttgt 


ctgctgtcat tggagcccca caagccatcc cagctcttgc cctactgttt ttttttttgt 

tgttgttgtt gttttttctt gttttttttt ttgttttgtt tttttgtttt tttgagccag 
tgttgttgtt gttttttctt gttttttttt ttgttttgtt tttttgtttt tttgaqccag 
tgttgttgtt gttttLtctt gttttttttt ttgttttgtt tttttgtttt tttgagccag 

tgttgttgtt gttttttctt gttttttttt ttgttttgtt tttttgtttt Lttgagccag 

agtctcgctc tgtcacccaa actggagtgc aatggcatqa tctoqgctca ctgcaacctc 
agtctcgctc tgtcacccaa actggagtgc aatggcatga tctcggctca ctgcaacctc 
agtctcgctc tgtcacccaa actggagtgc aatggcatga tctoggctca ct-gcaacctc 


agtctcgctc tgtcacccaa actggagtgc aatggcatga tctcggctca ctgcaacctc 

caccfccccag tttcaaacaa ttctcctgcc tcagcctccc gagtagctgg gattacaggc 

cacctcccag tttcaaacaa ttctcctgcc tcagcctccc gagtagctgg gattacaggc 

cacctcccag tttcaaacaa ttctcctgcc tcagcctccc gagtagctgg gattacaggc 


cacctcccag tttcaaacaa ttctcctgcc tcagcctccc gagtagctgg gattacaggc 


atgcaccacc: 
atgeaccacc 


acg cccaqct 
acgcccagct 
acgcccagct 


aatttttgta 
aatttttgta 
aatttttgta 


<Tt t ttggtag 
Gttttgqtaq 
cttttggtaq 


agatgggttt 
agatggqttt 
agatgggttt 


taccatgttg 
taccatgttg 
taccatgrrg 


atgcdccacc dcgcccayct aatLtLLgUa ctLLtggtag agatgggttt taccatgttg 

gccaqgctgg tctcgaactc ctgacetcag gtgatccacc cgcctcagcc tcccaaagtg 

gccaggctgg tcteiqaactc ctgacetcag gtgatccacc cgcctcagcc tcccaaagtg 

gccaggctgg tctcgaactc ctgacetcag gtgatccacc cgcctcagcc tcccaaagtg 


gccaggctgg tctcgaactc ctgacetcag gtgatccacc cgcctcagcc tcccaaagtg 

ctaggattac aggcatgacc cactgngcct ggcctcttgt cccattcttt agcttggcat 

ctaggattac aggcatgacc cactgtgct--t ggcctcttgt cccattcttt agcttggcat 

ctaggattac aggcatgacc cactgtgcct ggcctcttgt cccattcttt agcttggcat 
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ctaqqattac aqqcatgacc cact^tgcct ggcctcttqt i 


>Parcl of Er\s.#128<!l 
>NCBI b29 #15601 
>NCBI Old #15601 


>Partl of EnG.t»129[)l 
>HCBI b29 « 15661 

>KCBI old #15661 


>Partl of Ens. #12961 
b29 #15121 
>HCBI old »15-??1 


>Partt of Ens. 313021 
>NCBI t>29 »157S1 
>NCBI old #15T81 


^ Parti of Ens. « 13081 
>MCSI b29 fll5841 
>HCBI old #15841 


>Fartl of Ens. 813141 
>NCBI b29 « 15901 

>11CB1 old 1115901 


>Parcl of Ens.«13201 
>NCBI b29 #15961 

>i!iCBi old fiisgei 


>Partl of Ens. #132 61 
>NCBZ b29 #16021 
>NCSI old #16021 


>Partl of Em. #13321, 
>NC:bi b2 9 #16081 
>NCBI old #16082 


>Partl of Ens.#133Bl 
>1!ICBI b29 #16H1 
>NCB1 old #16141 


>Partl of En.s.# 13441 
>KCBI b2 9 #16201 
>NCBI old #16201 


sPartl of Ens. #13501 
>NCB3 b2 9 #16261 
>MCBI uid #16261 


>Partl of: Ens. #13561 
>NCBI b2 9 #16371 
>iaCBI Old #16321 


>Parta of Ens.ttl3621 
>NCBI 629 fl 16381 

>HCBI Did if 16381 


caccctgqct 
caccctggct 
caccctggct 


gagatgtggc 
gagatgtggc 
gagatgcggc 


tggcacacag 
tggcacaca? 
tggcacacag 


gtaactgtct 
gtaactgtct 
gtaactgrct 


tcctccqatt 
tcctccgatt 
tccteegatt 


cuaactcctg 
ctaactcctg 
ccaactcctg 


caccctggct gagatgtggc cggcacacag gtaactgtct tccteegatt ctaactcctg 


gacagtgaca 
gacagtgcica 
gacagtgsca 


ctgtctaaat 
ct.gtc;La«iat 
ctgtctaaat 


cccagatttg 
CL:i:agattt.g 
cccagatttg 


cacagctcaa 
uacagctcaa 
cacagctcaa 


gactggctgc 
gactggctqc 
gactggctgc 


atgggsggct 
atgggaqgct 
aCgggaggct 


gacagtgaca ctgtctaaat cccagatttg cacagctcaa gactggctgc atgggaggct 


ggatggggct 
ggatggqqct 
gqatggggct- 


ctcactccct 
ctcactccct 
ctcactccct 


ggtagaaaat 
qqtagaaaat 
ggtagaaaat 


aqgagacccc 

agqagacccc 


tgtggacatg 
tgtggacatg 
tgtggacatg 


ggatggggct ctcactccct actccaa. 


ggtagaaaat aggagscccc tgtggacatg 


ggacccccct 
ggacccccct 
ggaccx.-ccct 


gccacccttg 
gccacccttg 
gccacccttg 


tgcttgtagg 
tgcttgtagg 
tgcttgtagg 


agtctcaac^a 
agtctcaaca 
agtctcaaca 


ggcaggtycg 
ggcaggtgcg 
ggcaggtgcg 


tagctLtcaa 

tagctttcaa 
tagctLtcaa 


ggacccccct gccacccttg tgcttgtagg agtctcaaca ggcaggtgcg tagctLtcaa 


cgggtctgtc 
cgggtctgtc 
cgggtctgtc 


aagacgtgtg 
aagacgtgtg 
aagacgtgtg 


aggtggcggc 
aqgtggcggc 
aggtggcggc 


ctggtgcccg 
ctggcgcccg 
ctggtgcccg 


gtggagqatg 
gtggaggatg 
gtggaggatg 


acacacacgt 
acacacacgt 
acacacacgt 


cgggtctgtc aagacgtgtg aggtggcggc ctggtgcccg gtggaggatg acacacacgt 


gccacagtga 
gccacagtga 
gccacagtga 


qtccagccct 
gtccagccct 
qtccagccct 


agggaagqaa 
agggaaggaa 

agggaagqaa 


gtgccttttt 
gtgccttttt 
gtqcc-tttt 


gttttqtttt 
gttttgtttt 
gttttqtttt 


gttttagaca 
gttttagaca 
gttttagaca 


gccacagtga qtccagccct agggaagqaa gtgccttttt gttttqtttt 


cagtttcact 
cagtttcact 
cagtttcact 


ctgtatc-cc 
ctgtatc-cc 
ctqtatcgcc 
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tgccttggcc tcccaaagtg 
tgccttggcc tcccaaagtg 
tgccttggcc tcccaaagtg 


tagccaggat ggtctcgatc tcctgacctc gtgatccacc tgccttggcc tcccaaagtg 


ctgqgattgc acgcgtgagt 
ctgqgattgc acgcgtgagt 
ctgggattgc acgcgtgagt 


caccacgccc ggccctgaat 
caeca cgccc ggccctgaat 
caccacgccc ggccctgaat 


atttattatg aaaasciagaa 
atttattatg aaaaaaagaa 
atttattatg aaaaaaagaa 


ctgggattgc acgcgtgagt caccacgccc ggccctgaat atttattatg aaaaaaagaa 


tgtataatat ctcaLLgatt 
tgtataaLat ctcaLLgatt 
tgtataatat ctcaLLgatt 


tLLqtatqtt ggttacatgt 
tLtqtatqtt ggttacatgt 
ttLgtatgtt ggttacatgt 


tgaagtaata ttttgggtta 
tgaagtaata ttttgggfcta 
tgaagtaata ttttgggtta 


tgtataatat ctcattgatt tttqtatgtt. gqttacatqt tqaagtaata ttttgggtta 


ctgtataaaa ttttacttgt 
crgtataaaa ttttacttgt 
ctgtataaaa ttttacttgt 


ttcttttact ttgtagtqtg 
ttcttCtact ttgtagtgtg 
ttcttttact ttgtagtgtq 


qctaLtaqqa aactttaagt 
gcLattaqqa aactttaaqL 
qcLattaqqa aactttaaqL 


ctgtataaaa ttttacttgt ttcttttact ttgtagtgtg gcLattagga aactttaagL 


tggagtgtgg cttgtagttg 
tggagtgtgg cttgtagttg 
tggagtgtgg cttgtagttg 


tgtctcataa gatacttttt 
tgtcteataa gatacttttt 
tgtctcataa gatacttttt 


:tttgacaca ggatcttgct 
itttgacaca ggatcttgct 
:tttgacaca ggatcttgct 


tggagtgtgg cttgtagttg tgtctcataa gatacttttt ttttgacaca ggatcttgct 


ctgtcaccca gtgcagtgat 
ctgtCBCCca gtgcagtgat 
Ctgtcaccca gtgcagtgat 


aacgatcatg gctcactgca 
aacgatcatg gctcactgca 
aacgatcat<j gctcactgca 


gcttcgaact cctggactca 
gcttcgaact cctggactca 
gcttcgaact cctggactca 


ctgtcaccca gLgcegtqaL aacgatcatg gctcactgca gcttfrgaaet cctggactca 


agtgatcttc ccgcctcatu 
agtgatcttc ccgcctcacc 
agtgatcttc ccgcctcatc 


ctcctgtgta qctgggacca 
ctcctgLgta qcLqqgacca 
ctcctgtgta gctgggaeca 


caggggcgca ccaccatgcc 
caggggcgca ccsccatgcc 
caggggcgca ccaccatgcc 


agtgatcttc ccgcctcatc ctcctgtgta gctgggaeca caggggcgca ccaccatgcc 


CFigccaactt attttttqta 
cptgccaactt attttttqta 
eagccaactt attttttgr.a 


gagacagggt ctcaccatgt 
qagacagggt ctcaccatgt 
gaganagqqt ctcaccatgt 


tgcccaggct qqtcttyaac 
tgcccaqgct ggtcttgaac 
tgcccaqgct gqtcttgaac 
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tcctgggctc 
tcctgggct.c 
tcctgggctc 


aiigcdaccca cctgccttati cctcccaaag 
nagca^iccca cctgccttag cctccicaaay 
aagcaacecfi cctgcnt.taQ cct. crccaaag 


tqctqqqatt cicagcqtqaq 
tycLgqyatt acagtrgtqag 
tgctoggatt acagcgtgag 


tcctgggctc aagcaaccca cctgcctLag ccttjtrcaaag LgctgggatL acagcgtgag 


ccaccacacc Lggcctcaca atatactttt attggacatc 

ccaccacacc tggcctcaca atatacttLL attggcii:at.c 

ccaccacacc tggcctcaca atatactttt atcggacacc 

ccaccacacc tggcctcaca atatactttt dUtggacaLc 

gtgtaaaatq atttttaagt gcracractgat ataagaaata 

gtgtaaaatg acmtfcaagt gcacactgat ataagaaata 

gtqtaaaatg atttttaagt gcaeact^aC araagaaatri 


actgttgcag atctaatgcg 
acLgktgcag atctaatgcg 
actgttgcag atctaatgcg 

actgttgcag atctaatgcg 

acattggcca qgcacag 

tgg 

acattggcca ggcacagtgg 
acattggcca ggracagtgg 


gt.gtaaoatg atttttaaqt gcncactgiat ataagaaata acattggcca ggcacagtgg 


cLcatg-ctg 
ctcratg-ctq 
ctcatgccta 


taatccLaac acLttqqqag qctqaggcag 
taatcctaac actttgggag gctgaggcag 
taatcccagc actttgggag gcagagacgg 


tcggatcatt tgaggccagg 
L^ggatcatt tgaggccagg 
gtggatcact tgaggccagg 


ctcatgcctc Caatccyarc autttgggag gcwgagccjrg Icyggatcayt tgaggccagg 


agttcaaatt 
agttcaaaLL 
agttcaaaac 


cagcctggc4? aacatggtga aaccccctct 
cagccLggcc aacatgqtga aaccccctct 
caqcctggcc aacatggtga aaccccctct 


rrtaetaaana tataaaaatt 
ctactaaaaa tataaaaatt 
c:Cactaaaaa tataaciaatt 


agttcaaawy caqcctggcc aacatggtga aaccccctct ctactaaaaa tataaaaatt 


agccaggcgt 
agccaggcgt 
agccaggcgt 


ggtggtgcgc acrrtgtaatc ccactactca 
ggtggtgcgc acctgtaatc ccactactca 
ggtggtgcgc acctgtaatc ccactactca 


ggaggctgag gcacgagaat 
ggaggctgag gcacgagaaC. 
ggaggctgaq gcacgagaat 


agccaggcgt ggtggtgcgc acctgtaatc eeael:aetca ggaggctgag gcacgagaat 


tgcttgaacc 
tq'cttgsacc 
tgcttgaacc 


cgggagatag aagttgcagt gagccadgst 
cgggsgatag aagttgcagt: gagccaagat 
cgggagatag aagttgcagt gagccaagat 


caLgccactg cgctccagcc 
catgccactg cgctcaagcc 
catgccactg cgctccagcc 


tgcttgaacc cigggagatag aagttgcagt gagccaagat catgccactg cgctccagcc 


cgggtgacag 
cgggtgacag 
cgggtgacag 


aataagaccc tgtctcaaaa acagaacaaa 
aataagaccc tgtctesaaaa acagaacaaa 
aataagaccc tgtctcaaaa acagaacaaa 


gactaaaacg ttttggsact 
gactaaaacg ttttggaaut 
gactaaaacg ttttggaact. 


cgggtgacag aataagaccc tgtctcaaaa acagaacaaa gactaaaacg ttttggaact 


agacagaggt 
agacagaggt 
agacagaggt 


gatgattgca caacattat:g aatgtactaa 
gatgattgca caacattaLg aatgl.actaa 
gatgattgca caacattatg aatgtactaa 


atgccacaga attgtccagt 
atgccacaga attgtccagt 
atgccacaga attgtccagt 


ayacagagyt gatgattgca caacattatg aatgtactaa atgccacaga attgtccagt 


ttaaaatggt 
ttaaaatqgt 
ttaaaatqgt 


taartttatg rtatgtgaat ttcatctcaa 
taattttatq ttatgtqaat ttcatctcaa 
taaCLttatg ttatgtgaau ttcatctcaa 


tttctctaaa aaggtgagtt 
tttctctaaa aaggtgagtt 
tiLCLCtaaa aaggtgagtt 


ttaaaatggt taattttatg ttatgtgaat ttcatctcaa tttctctaaa aaggtgagtt 


gatcttaaag 
gatcttriaag 
gatcttaaaq 


tattacatgt gaggtacaca ggtgtgtgtg 
tartacatgt gaggtacaca ggtgtgtgtg 
tattacatgt gaggtacaca ggtgtgtgtg 


tgggtgttac ggtacaggaa 
tgggtgctac ggtacaggaa 
tgggtgttac ggt-.acaggaa 


gatcttaaag tattacatgt gaggtacaca ggtgtgtgtg tgggtgttac ggt^icaggaa 


at-gacaaaaa 
atgacaaaaa 
atgacaasaa 


cttgaacacc ccLgttgtaa gccatccctc 
cttgaacacc cctgttgtaa gccatccctc 
cttgaacacc cctgttgtaa gccatccctc 


ttgagggagt ggqqactttq 
ttgaggqaqt ggggactttg 
ttgagggagt ggggactttg 


atgacaaaaa cttgaocacc cctgttgtaa gccatccctc ttgagggagt ggggactttg 


aagacctgag 
aagacctgag 
aagacctgag 


agaagtccag gcac^gtggc ccatgcctgt 
agaagtccag gcacagtggc ccatgcctgt 
agaagtccag gcacagtggc ccatgcctgt 


aatcccagca ctttgggaga 
satccuagca ctttgggaga 
oatcccagca ctctgggaga 


aagacctgag agaagtccag gcacagtggc ccatgcctgt aatcccagca ctttgggaga 


ccgaggtggg 
ccgaggtggg 
ccgaggtggg 


tggatcatct gaggtcagga gtttgagacc 
tggatcatct gaggtcagga gtttgagacc 
tggatcatct gaggtcagga gtttgagacc 


agcctgacca acntggtgaa 
agcctgacca acatggtgaa 
agcctgacca acatggtgaa 
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ccgaggtggg tggsitcatct qaay tc-ciyya qtlLtfayacc aycctgacca dcatqqtgcia 


accccatctc 
accccatctc 
accccatctc 


Bccccatctc tactaaaaat atgaaa^aa 


tactaaaaat atgaaaaaat 
tacraaaaat atgaoaaaat 
tactsaaaat atgaoaaaat 


tagccaggcg 
tagccaqgcg 


tqgtqgtgca 
tggtggtgco 
tggtggtgca 


cgcctgtaac 
cgcctgtaac 
cgcctgtaac 


tafj[7:caqgcg tggtggtgca cgcctgti 


cccaqctaot 
cccaqctiict 
cccagctact 


Cqgqaggctg agqcaqgaqa 
cgggaygctg aggcacigaqa 
cqggaggctg agguaggaga 


atcacttgaa 
atcacttgaa 
atcacttgaa 


cccagaggtt 
cccaqaggLt 
cccagaggCt 


gcaqtgagcc 
gcaqtgagoc 
gcagcg<<gc?c 


cccagctact cgggaggctg aggcaggaga atcacttgaa cccagaggtt gcagcgagcc 


gattgcgg 
gagattgcgg 
gaqatcgcgg 


cattgcactc cagcntgagc 
cattgcactc cagcctgagc 
cattgcactc cagectgage 


aariagqagca 
aacaggagca 
aacaggagc-a 


agactccatc 
agactccatc 
agactccatc 


gagatygcgg cattgcactc cagcctgagc aacaggagca agactccatc tcaaaaaaaa 


aaaaaacctg 
aaaaaacctg 
aaaaaacctg 


agagaagccc cttggtccca 
sgagaagccc cttggtccca 
agagaagccc cttggtccca 


gcctttctct 
gcctttctct 
gcctttctct 


gacagcagtg 
gacagcagtg 

gacagcagtg 


gtgacaggct 
gtgacaggct 
gtgacaggct 


:tg agagaagccc 


ttggtccca gcctttctct gacagcagtg gtgacaggct 


cagctctcct 
cagctctcct 
caqctctcct 


ccgagtgcag ncctgt car:t 
ccgagtgcag ccctgtcact 
ccqaqtgcaq ccctgtcact 


ggcctrgctc 
ggccr tgctc 
gaccttgctc 


ctgtctcagg 
ctgtctcagg 
ctgtctcagg 


gr:tgggaaga 
gctgggaaga 
gctgggaaga 


cagctctcct ccgagtgcag ccctgtcact grccttgctc ctgtctcagg gctgggaaga 


ctgttgatgt 
Ct^ttgatgt 
Ctgttgatgt 


tgtcatCcca aaqatcccac 
torccattcca aagatcccac 
tgtcattcca aagatcccac 


ctggabcogg 
ctggaccagg 
ctggatcagg 


ggaacatccc 
ggaacatccc 


ccacagaagg 
ccacagaagy 
ccacagaagg 


ctgttgntgt tgtcattcca aagatcccac ctggatcagg ggaaca^ccc ccacagaagg 


gttagccata 
gttagccata 
gttagccata 


pagtgccoga ttctceagga 
cagtgccaga ttctccagga 
cagtqccaga ttctccagga 


gaaattcace 
gaaattcacc 
gaaattcacc 


aaagnaatgg 
aaagaaatgg 
aaagaaatgg 


agtcceettg 
agtccccttg 
agtcceettg 


gttagccata cagtgccaga ttctccagga gaaattcacc aaagaaaLgg agtccccttg 


gggacagatt 
gggacagatt 
gggacagatt 


caacttgtat tgtcagccag 
caacttgtat tgtcagccag 
caacttgtat tgtcagccag 


gagctgacgt 
gagctgacgt 
gagctgacgt 


ggcacttctg 
ggcacttctg 
ggcacttctg 


agaagaggcc 
agaagaggcc 
agaagaggcc 


gggacagatt caacttgtat tgtcagccag gagctgacgt ggcacttctg agaagaggcc 


qqcgcdiCCtg 
qqcgctscctg 
gguqt^acctg 


ctqgcqgqtg ctttqtqcac 
ctggcqqqtg ctttqtqcac 
ctqgcgqqtq ctttqtquac 


ttttcagaca 
ttttcagaca 
ttttcaqaca 


ggtcaggatc 
'ggtcaggatc 
ggtcaggatc 


cagcctgtag 
cagcctgtag 
cagcctgtag 


ggcgcacctg ctggcgggtg ctttgtqcac ttttcagaca gyLcaggatu cagcctgLag 


geaaar.ttac 
gcaaatttac 
gcaaatttac 


trctgctttg acctgtaaaa 
ttttgctttg acctgtaaaa 
ttttgctttq acctgtaaaa 


ccggatctgc 
ccggatctgc 
ccggatctgc 


ccagccttca 
ccagccttca 
ccagccttca 


ttctttccct 
ttctttccct 
ttctctcect 


gcaaatttac ttttgctttg acctgtaaaa ccggatctgc ccagccttca ttetytecct 


ggagaacgcc 
gqagaacqcc 
ggagaacgcc 


tgcggcccca aagccaggcc 
tgcggcccca aaqccaggcc 
tgcggcccca aagccagycc 


tactgatttc 
tactgatttc 
tactgatttc 


cagtgaggcc 
cagtgaggcc 
cagtgaggcc 


acaaatcccc 
acaaatcccc 
acaaatcccc 


ggagaacgcc tgcggcccca aagccaggcc tacLgatttc cagtgaggcc acaaatcccc 


tccctggtta 
tccctggtta 
tccctggtta 


gcaattcagt tttcaccgcc 
gcaat-.tcagt tttcaccgcc 
gcaattcagt tttcaccgcc 


ttqggggaga 
ttgggggaga 
ttgggggaga 


gggcctgccc 
gggcctgccc 
gggcctgccc 


ttgcttgagg 
ttccttgagg 
ttccttgagg 


tccctqgtta gcaattcagt tttcaccgcc ttgggggaga gggcctgccc ttccttgagg 


. ggaaggctcg qctcagtqtc 
ggaaggctcg gctcagcgtc 
. qgaaggctcg gctcagtgtc 


ttctcttgga 
Ltctcttqga 
ttctcttgga 


agaqaagaqt 
aqagaagagt 
agagaagagt 


gtgcatgcag 
gtgcatqcaq 
gtgcatgcag 


gqangaacct ggaaggctcg gctcagtgtc ttctcttgga agagaagaqt gtgcatgcag 


aagggtgtag 
aagggtgtag 
^agqqtgtag 


aaaatgctag acgtgtttca 
aaaatgctag acgtgtttea 
aaaatgcLag acgtgtttea 


tcttcttgac 
tcttcttgac 
tcttcttgac 


aaaatgacat 
aaaatgricat. 
saaatgacat 


tgtaagatgt 
tgtangatgt 
tgtaagatgt 
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aagggtgtag aaaatgctag acgtgtttca tcttct;tqac aaaatgacaC tqtaaeyiztst 


gtgtatatgt rtttaaaarit 
gtglaLatgt Lt-tLaaaadt 
gtgtatatgt ttttaaaaat 


.Tttacatagy 
attacatagg 
attacatacfq 


ggctqaacat 
ggctgaacat 
cjgctgaaeat 


(jgtggccKac 
ggtggcccac 
gqtgqcccac 


tcctqtagci: 
tccfegtagtc 
tcctgtaqtc 


attacatagg ggntgnacat ggtggccnan tcccgtcTgtr; 


c^gccJoaggg agycggaggt 
caqcacaggg aggcggaggt 
cagcacaggg aggcgQsggr 


aggaagattg 
aggaagattq 
agqaaqatzg 


cttgaggtca 
cttgaggtca 
cttgaggtca 


ggagttcaag 
ggagttcaag 
ggagt tciiag 


accagccLyg 
aecagcctqg 
accagcccgg 


cagcacaggg agguggaggt aggaagalLg cttgaggtca ggagttcaag accagcctyg 


gcaatigtagg aagacttcat 
gcaatgtagg aagacittcat 
gcaatgtagg aagacttcat 


gtctacaaaa 
gt-.ctacaaaa 
gLctacaaaa 


aaaatttcat 
aaaatttcat 
aaaatttcBt 


tagccaqgca 
t-agccaggca 
tagccagqca 


gcaatgtagg aagacttcat gtctacaaaa i 


.atttcat taqccaggca 


tggtgatgca tgcctatagt 
tggCgatgca tgcctatagt 
tggtgatgca tgcctatagt 


tccagctact 
tccayctact 
tccagctact 


tgagaggctq 
tyijgagqctg 
tgagaggctg 


aggtgggaga 
agqtgggaga 


occccctgag 
acuccctgaq 
accccctgag 


tijgtgatgca ngcctat^gt tccagctact. tgagaggctg aggtgggaga accccctgag 


cctgggaggt 
cctgggaggt 
cctgggaggt 


cgaggctaca 
cgaggctac-a 
cgaggctaaa 


agaagcLqtg 
agaagctgtq 
agaagctgtg 


:tt^gtgcaac 
:tcgtgcaac 
rt.catgcaac 


tqtactccaq 
tgtactccag 
tgtactttcag 


cctgygcaae 
cctqgqcaac 
cctgggcaac 


cctgggaggt ugaggctaca agaag<jtgtg ttcxtgcaac Lgtactccag cctgggcaac 


cctgtctcaa 
oetgtctcaa 
vctgtctcaa 


aaatal:atat 
aaacaratat 
aaataLatat 


aqgctgggcg 
aggetgggcg 
aggctgggcg 


tgatgqctta 
tgatggcr.ta 
tgatggctta 


tgcctgtaae 
tgcctgtaat 
tgcctgtaat 


agegcaagac cetgtcccaa aaaea-c'atat aggctgggog tgatggctta tgcctgtsat 


cccagnactt 
cccagcactt 
cccagcactt 


tgggaggctg 
tgggaggctg 
tgggaggctg 


aagcaggcgg 
aagcaggcgg 
aagcaggcgg 


atcacttgag 
atcacttgag 
atcacttgag 


gccaggagtt 
gccaggagtt 
gccaggagtt 


ggagaccagc 
ggagaccagc 
ggagaccagc 


cccagcactt tgggaggctg aagcaggcgg atcacttgag gccaggagtt ggagaccagc 


cLgaccaaca 
ctgaocaaca 
ctgaccaaca 


tggcggaacc 
tggcggaacc 
tggtqgaacc 


ctgtctctac 
ctgtctctac 
ctgtctct ac 


aaaaaagtta 
aaaaaagtta. 
aaaaaagtta 


accaggcata 
accaggcata 
accaggcata 


ctgaccaaca tggyggaacc ctgtctctac taaaaataca aaaaaagtta accaggcata 


qcggcecaca 
gtggctcaca 
gtggct caca 


cctgtgtaat 
cctgtgtaat 
cctgtgtaat 


cccagctaca 
cccagctaca 
cccagctaca 


cgggaggctg 
cgggaggctg 
cgggaggctg 


aggtggeagt 
aggtggcagt 
aggtggeagt 


gagctgagat 
gagctgagat 
gagctgagat 


gtggctcaca cctgtgtaat cccagctaca cgggaggctg aggtggcagt gagctgagat 


cgcgccattg 
cgcgccattg 
eg eg ccattg 


cact caagcc 
cactcaagcc 
cactcaagcc 


tgcacjtcgag 
tgcactcgaq 
tgcactcgacj 


gLgacagagL 
gtgacagagt 
gtgacagagt 


gactccatct 
gactccatct 
gactccatct 


caaaaaa-aa 
c-aaaaaaaa 
caaaaaaaa^ 


attg cactcaagcc tgcautcgag gLgacagagL gactccatct caaaaaaaaa 


aaaaaaaata 
aaaaaaaata 


tatatatata 
tatatatata 
tatatatata 


tatatacaca 
tatatacaca oacacacaca 
tatatatata 


cagacac — a 
cagacacaca 
cacacayaca 


carncaa tac 


aaaaaaaawa tatatatata tatatayaya cacacacaca casacaaaca cacacasyac 


aatacatats tatatgtgtg 

a — tata tatatgtgtg 

aatacatata tatatgtgtg 


tgtatatata 
tgtatatata 
tatatatata 


tatataacat 
tatataacat 
tatataacat 


ggttatgtga 
ggttatgtga 
ggttatgtga 


gtatttgtat 
gtatttgtat 
gtatttgtat 


aatacatata tatatgtgtg trtatarata tatcitaacat ggttatgtgn gtatrtgtat 


gtattttatt aacagtaatg 
gtatttcatt aacagtaatg 
gtatetcatt aacagtaatg 


ttatcattgt: 
ttatcattgt 
ctatcattgt 


ttttcagacc 
ttttcagacc 
ttttcagacc 


tgc^Lttttta 
tgctttttta 
tqctttttta 


aaggctgcag 
aaggctqcaq 
aaggctqesg 


attgL ttttcagacc tgctttttta aaggctgcag 


aaaacttcac tcttttqgtt . 
aaaacttcac tcttttqgtt . 
aaaacttcac ccttttggtc 


tctggtaccc 
tctggtaccc 
tctggtarcc 


caaatttaat 
caaatttaat 
caaatttaat 


ttcagcsaqt 
ttcagcaa<;t 
ttcagcaagt 
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#22141 
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#22201 
#22201 


•3124 

#22261 

•22261 


aagtggtgyc 
eagtggtggc 
aagtggtrjgc 


caggtgtgtg 
caggtgLgtg 
Cciggtgtgtg 


agtLcaccag 
ogttcaccag 
agtrc:accag 


g<]tcttggag 
gatcttggag 
ggtcttggag 


aaacttctgg 
aaactt:ctg9 
a^acttctgg 


ctcttcttjtc 
ctcttctctc 
ctcttctctc 


aagtggtggc caggtgtgtg agtttjaccag ggtcttggag aaacttctgg ctcttctctc 


ttctgrtqagq 
LLctctgagg 
ttctctgagg 


ttttcqtcgc 
LttLcqLcgc 
ttttcgtcgc 


tctgattttc 

tctqatLLtc 

tctgattttc 


tgcttcctct 

tgcttcctct 

tgcttcctct 


cgactttagg 
cgactttagg 

cgactttagg 


aqgaaLatcc 
aggaatatcc 
aggaatafcc 


ttctctgagg ttttcgtcgc tcLgattttc tgcttcctct cgactttagg aggaatatcc 


ttecf^aa<^at 
ttcccaacat 
ttccc^acat 


cannactact 
caccactact 
caccdctact 


t^icctcaagr 
tacctcaagt 
caoctcaagc 


cgtgcatrta 
cgtgcattta 
cgtgcattca 


tgatgctaaa 
tyatqctaaa 
Lgatgctaaa 


acagatccct 
dCaqatcccL 
acagatccct 


ttcccaacat caccactact tacctcaagt cgtgcattta tgatgctaaa acagatccct 


tctgcccvat 
tctgcCQCQt 
tctgccccat 


attccgtctt 
attccgtntt 
attccgtctt 


ggcaaaatag 
ggcaanatag 
ggcaaaatag 


tggagaacgc 
tgga^aacgc 
tggagaacgc 


aggacacagt 
aggacacagt 
aggacacagt 


ttccaggaca 
ttccaggaca 
ttccaggaca 


tctgccccat attccgtctt ggcaaaatag tggagaacgc aggacacagt ttccaggaca 


tgqccgtqga 
tggccgtgga 
tggccgtgga 


gqtgqqtgcq 
ggtgggLgcg 
ggtgggtgcg 


gqccctqqct 

ggccctggct 
ggccctggct 


ctcctqaccc 
ctcctgaccc 
ctcctgaccc 


agccctqgag 
agccctggag 
agccctggag 


gogtctcqtg 
gcgtctcgtg 
gcgtctcgtg 


tggccgtg^a ggtgggtgcg ggccctggct ctcctgaccc agccctggag gcgtctcgtg 


ccaggtgctg 
ccaggtgctg 

ccaggtgctg 


a^aaagcct 
aggaaagcct 

aggaaagcct 


tgcegtgtct 
tgccgtgtct 
tgccgtgtct 


ccgctgctca 
ctgctqtctca 
ctgcLgctca 


tf^ocaggga 
tccccaggga 

tccccaggga 


ggcatcatgg 
ggcatcatqg 
g<;c3kcatgg 


ccaggtgctg aggaaagcct tgccgrgtct ccgctgctca tccccaggga ggcatcatgg 


gcatccaggt 
gcatccaggt 
gcat ccaggt 


caactggyac 
caactgggac 
caactgggac 


tgcaaccLgg 
tgcaacctgg 
tgcaacctgg 


jcag^gccgc 
icogagccgc 
icagagccgc 


ctccctctgc 
ctccctctgc 
ctccctctgc 


tLgcccaggt 
ttgcccaggt 
ttgcccaggt 


gcatccaggt caactgggac tgcaacctgg acagagccgc ctccctctgc ttgcccaggt 


actccttccg 
actccLtccg 
actccttccg 


ccgcctcqat 
ccquctcgat 
ccgcctcgat 


acaogggacg 
acacgggacg 
Bcacgggacg 


ttga«rcacaa 
ttga^cacaa 


cgtatctcct 
egtatctcct 
cgtatctcct 


ggctacaatt 
ggctacaatt 
ggcta 


actccttccg ccgcctcgat acacgggacg ttgagcacaa cgtatctcct ggctacaatt 


tcaggtgggc 
tcaggtgggc 
tcaggtgggc 


gtgagcttgg 
gtgagcttgg 
gtgagcttgg 


geccctcgct 
gcccctcgct 
gcccctcgct 


catgttgtag 
catgttgtag 
catgttgtag 


ggggtgergg 
ggggtgctgg 

ggggtgctgg 


tggecgcgta 
tggctqcgta 
tggctgcgLa 


tcaggtgggc gtgagcttgg gcccctcgct catgttgtag ggggtgctgg tggctgcqt, 


cgtgccagCg 
cgtgccagtg 
ogtgccagtg 


ggccguccac 
ggccgcccac 
ggccgcccac 


tgaagaccag 
tgaagaccag 
tgaagaccag 


cactcaggca 
cactcaggca 
cactcaggca 


gca^cccaag 
gcaccccaag 
gcaccccsag 


ggcaggctgc 
ggcaggctgc 
ggcaggctgc 


cgtgccagtg ggccgcccac tgaagaccag cactcaggca gcaccccaag ggcaggctgc 


eggteccccg 
c^gtcccccg 
eggteccccg 


tccaaggcgg 
tccaaggcgg 
tccaaggcgg 


cgggaaggcc 
cgggaaggcc 
cgggaaggcc 


atgctgggaa 
atgc tgggaa 
atgctgggaa 


aatgccctta 
aatgocctta 
aatgccctta 


gtggtgtcct 
gtgg tgtccL 
gtggtgtcct 


eggteccccg tccaaggcgg cgggaaggcc atgctgggaa aatgccctta gtggtgtcct 


gctccggggc 
gctccggggc 
gctccggggc 


catcccggcc 
catcccggcc 
catcc cggcc 


cccgagaccc 
cccgaqaccc 
cccgagaccc 


ctccttgcce 
ctccttgccc 
ctccttgcce 


tttcctgccg 
tttcctgccg 
tttcctgccg 


caagaaacat 
caagaaacat 
caagaaacat 


gctccggggc catcccggcc cccgagaccc ctccttgcce tttcctgccg caagaaacat 


gttgagatgg 
gttgagatgg 
gttqagatgg 


ttcttagagc 
ttcttagagc 
ttcttagagc 


agccaggaga 
agcoaggaga 
agccaggaga 


agctgggggc 
agctggggge 
agctgggggc 


ttaagctctc 
ttaagctttc 
ttaagctttc 


cagcacctgc 
cagcacctge 
cagcacctgc 


gttgagatqg ttcttagagc agccaggaga agctgggggc ttaagctttc cagcacctgc 


ctcsgccatg 
ctcagccatg 
ctcsgccatg 


acctccatte 
acctccattc 
acctccacrc 


actgcctcaa 
actgcctcaa 
actgcctcaa 


ggagcggatg 
ggagcggatg 
ggagcggatg 


atcttgtgat 
atcttgtgat 
atcttgtgat 


eetccaqtcc 
cctccagtcc 
ccteeagtcc 
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aaagqcccct 
aaaggcctct 


ggggcctggc 
gqggnctggc 
gqggcctggi= 


ccaggaattg 
crraggaantg 
ccaggaatrg 


gtttctcaaa 
gtttcccaaa 
gcttctfiriaa 


ggtcgaacct 
ggttgaacct 
ggctgaacct 
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gfcfjncagaat 
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ctccagagtg 
ctcc^agagtg 
ctccaqagcg 


casgggacac 
caggggacac 


aggtntgagg 
Bggt-.ctgagg 
aggtctgagg 


cccctgaaag 
cccctgaaag 

cccctgaaag 


gccctqctgg 
gccttgrjtgg 
gccttgctgg 


gttccctgrjt 
gttccct-gct 
gttccctgct 


ctccagagtg cagggyacac aggtctgagg cccctgaaiig gccttgcLyg gttccctgct 


gcaaatqcrg 
gcaaatgctg 
gcdaatgctg 


gcatctriggt 
qcatctcggt 
tjcatctcggt 


gtcctggggc 
gtcctgggge 
gtcctggggc 


agatgtaagc 
agatgtaagc 
aqatgtaaqc 


gagtcccaca 
gagtcccaca 
gagtcccaca 


aacaqtqgcf 
aacagtggct 
aacaqtgqct 


gcaaatgctg gcatctcggt gtcctggggc agatgtaagc gagtcccaca aacagtggct 


tgcaactaca 
tqcaactaca 
tgcaactaca 


ggacttgatt 
ggacttgatt 
ggactbgatt 


ctctcacagt 
ctctcacaqt 
ctctcacdg t 


tctcacagtt 
tctcacagtt 
tctcacagtt 


atii gaggcca 
Ctagaqgcca 
Ctaga<igcca 


caacctaaaa 
caiicctaaaa 


tgcaacnaca ggacttgatr ctctcacagt tctcacagtt ctagaggcca caacRtaaaa 


tccaggcatc 
tccagqcatc 
Leeway qcatc 


igcagggctg 
igcaqgqctg 
lycagggcty 


tgctcgctct 
tqctcgctct 
tqctcgcLct 


gaaggctctg 
qaaqgctctq 
qaaggutctg 


gagagagtcc 
qagagaqtcc 
gagiagaqtcc 


ttacttgtct 
ttacttqtct 
ttacttgtct 


tccaggcatc agcagggctg tqctcgctct gaaggctctg gagagagtcc ttacttgtct 


ctcctggctt 
Ctcc-Lggctt 


ctggtagccc 
ctggtagccc 
uLggtagccu 


acggtgttcc 
acqqtgttcc 
acggtgttcc 


ttggcttgtg 
ttqqcttgtg 
ttggcLLgtg 


gatgcatcac 
gatgcatcac 
gatguatcac 


tccagtctct 
tccagtctct 
tc:cagtctct 


Ctcctggctt ctggtagccc acggtgttcc ttggcttgtg gatgcatcac tccagtctct 


gcctctcctg 
gcctcLcctg 
gcctctcctg 


tcatgaggcc 
tcaLgagqcc 
tc;atgaggcc 


ttctgccttq 
ttctgccttg 
ttctgcctty 


tacgtgtctc 
tacgtgtctc 
tatgtgtctc 


tgtqtctttt 
tqtgtcLttt 

tgtgtctttt 


cttcttgtaa 
cttcLtgtaa 
cttcttgtaa 


gcctctcf^tg tcatgaggci 


rtgccttg taygtgtctc tgtgtctttt ettcttgta 


ggatatcagL 
ggatatcagt 
ggatatcagt 


cactgaaLtt 
cactgaattt 
cactgaattt 


agggcccacc 
agggcccacc 
agggcccacc 


ctactctagt 
ctactctagt 
ctactctagt 


atgacctcaL 
atgacctcat 
atgacctcat 


cttaacaadt 
cttaacaaat 
Gttaacaaat 


ggatatcagt cactgaattt agggc^reacc ctactctagt atgacctcat cttaacaaat 


tacgtctgcc 
tacgtctgcc 
tacgtctgcc 


aatatcccat 
aatatcccat 
aatatcccat 


ttccaaata^ 
ttccoaataa 
ttccoaataa 


gqtcacactc 
ggtuacactc 
ggtcacaccc 


ccaagcttca 
ccaagcttca 
ccaagcttca 


ggtagatqtg 
ggtagatgLg 
ggcagatgtg 


tacgtctgcc aatatcccat ttccaaataa ggtcacactc ccaagcttca ggtagatgtg 


aa LLLttggg 
aatttttggg 
aattttCggg 


gacacactgL 
gacacactgt 
gacacaccgt 


tcaacccact 
tcaacccact 
tcaacccBCt 


acagcLgatg 
acagctgatg 
acagctgatg 


atcLcatgat 
atctcatgat 
otctcatgat 


cagaatctat 
csagaatctat 
cagaatctat 


aatttttggg gacacactgt tcaacccact acagctgatg atctcatgat cagaatctat 


agccaggtaa 
agccaggtaa 
agccaggtaa 


attLccacgL 
atttccacgt 
atttccacgt 


accctttgtt 
accctttgtt 
accctttgtt 


gcattLttca 
gcatttttca 
gcQtttttca 


cagtccttaa 
cagtccttaa 
cagtccttaa 


aaaggaagcg 

aaaggaagcg 
aaaggaagcg 


agccaggtaa atttccacgt accctttgtt gcatttttca cagtccttaa aaaggaagcg 


ctccaggccg 
crccaggccg 
ctccaggccg 


ggcgcagtqa i 
ggcgcagtga i 
ggcgcagtga i 


: acLttgggag 
: actttgggag 
: actttgggag 


yccaaggcgg 
gccaaggcgg 
gccaaggcgg 


Ctccaggccg ggcgcagtga ctcacacctq taatcctagc actttgggag gccaaggcgg 


gcagatcacc 
gcagatcacc 
gcagatcacc 


tgaggccagc 
tgaggccagc 
tgaggccagc 


atggcgaaac 
atggrgaaac 
atggcgaaac 


cccatctcta 
cccatctcta 
cccatctcta 


ctaaaaaaaa 
ctaaaaaaaa 
ctaaaaaaaa 


gcagatcaec tgaggccagc atggcgaaac cccatctcta ctaaaaaai 


agctgggcat 
agctqqgcat 
aqctggqca t 


ggcggcgcat 
qgcqgcgcat 
ggtggcgcat 


gcctgtaaec 
qcctgt?»aCc 
qcctqtaatc 


ccagcaactt 
ccagcaactt 
ccagcaactt 


gggaggctga 
gggaggctga 
gggaggctga 


tacaaaaatt 
tacoaaaatt 
tacaaaaatt 

tacaaaaatt 

ggcaggagaa 
ggcaggagaa 
ggcaggagaa 
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aacrgggcac ggyggcgcat gcctgcaatc ccagcaactt gggaggccqa ggcaggagaa 

tciUJLtLqaqc cagggaggca gaggtcgcdg Lgagccguga tcgcaccgcc gcaccccagc 
tcgcttgagc cogggoggca gaggttgcag tgagccgaga tcgcactgct gcoctccagc 
tcgcttgagc cagggaggca gaggttgcag tgagccgaga tcgcactgct gcactccagc 


tcgcttgagc cagggaggca gaggttgcag tgagccgaga tcgcactgct gcoctccagc 

ctqqgcgaca aaqcaagtct ccqtctcaqg gggaaaaaaa agaggccctg caaggaccat 

ctgggcgaca aaqcaagtct ccgtctcagg gggaaaaaaa agaggccctg caaggaccat 

ctgggcy^ca aagt_-a^gtct CL-ytctcagg gggaaaaaaa agaggccctg caaggaccat 


c^tgggcgaca aagcaagtct ccgtctciagg gggaaaaaaa agaggccctg caaggaccat 

ccctgggcgc ggtcogcqgt gct.ttgggck gcacttgcct ceccctagtg gtgacgtgtg 

ccctgggcgc ggtcagcggt gctttg^gct goacttycct cccctrLagtg gtgacgtgtg 

ccctgggcgfc ggtcaqcggt gctttgggcc gcacctgcct ceccctagtg gtgacgtgtg 


ccctgggcgc ggtc^gcggt gctttgggct gcacttgcct cccccnagtg gtgacgtgtg 

gaatccgatic catacgggat cctaagaccr. ctgtaattgt qggatctgct gtccttttga 
gaatccgatc catacgggat cctaagacct ctgtaattgt gggatctgct gtccttittga 
gaatccgatc catacgggat cctaagacct ctgtaattgt ggganctgct gtccttttga 


gaatccgatc catacgggat cctaagacct ctgtaattgt gggatctgct gtccttttga 

accLLctggc tcactgagta aagcegatiiL ttgaattttt cttttctagc agttagattt 
accttctggc tcactgagta aagcagatat ttgaattttt cttttctagc agttagattt 
accttctggc tcactgagta aagcagatat ttgaattttt cttttctagc agttagattt 


accttctggc tcactgagta aagcagatat ttgaattttt cttttctagc agttagattt 

gtgttattcc atctctactg tcacacactt cttgtggaaa cctcaagctt aggtcagagc 
gtgttattcc atctctactg tcacacactt cttgtggaaa cctcaagctt aggtcagagc 
gtgttattcc atctctactg tcacacactt cttgtggaaa cctcaagctt aggtcagagc 


gtgttattcc atctctactg tcacacactt cttgtggaaa cctcaagctt aggtcagagc 

cctaaaacgg aaagacctca ggacgcctat cccccaacct cgggcagcac caggccctgc 
cctaaaacgg aaagacctca ggacgcctat cccccaacct cgggcagcac ceggcccrtgc 
cctaaaacgg aaagaactca ggacgcctat cccccaacct cgggcagcac caggccctgc 


cctaaaacgg aaagamctca ggacgcctat cccccaacct cgggcagcac caggccctgc 

ctgtgtctcc caggcctggc ccatggcgtg gaagttgtgg gccagataga gcatggagct 
ctgtgtctcc caggcctggc ccatggcgtg gaagttgtgg gccagataga gcatqgaqct 
ctgtgtctcc caggcctggc ccatggcgtg gaagttgtgg gccagataga gcatgqagct 


ctgtgtctcc caggcctggc ccatggcgtg gaagttgtgg gccagataga gcatggaqct 

agggtgcggg ccgaggctcc tgatggatcc ctggagaaca tggggcagca gatggtatgt 
agggtgcggg ccgaggctcc tgatggatcc ctggaqaaca tggggcagca gatggtatgt 
agggtgcggg ccgaggctcc tgatggatcc ctggagaaca tggggcagca gatggtatgt 


agggtgcggg ccgaggctcc tgatggatcc ctggagaaca tggggcagca gatggtatgt 

cttgacggbt cLtcaccaatj catccatc:cc ctgctcaLgc gtgtgtgctc tctcgcgtgc 
cttgacggtt cttcaccaac catccstccc ctgcttiatgc gtgtgtgctc tctcgcgtgc 
cttgacggtt rrttcaccaac; catccatr:cc ctgctcatgc gtgtgtrgctc tctcgcgtgc 


cttgacggtt cttcaccaac catccatccc ctgct{3atgc gtgtgygctc tctcgcgtgc 

teqctctctc tctctctcat atatqtatat gtatatatqt qtatatatat acatatataq 
tcgctctctc tctctctcat atatgtatat gtatatatgt gtatatatat atatatatag 

tcgctctctc tctctctcat atatgtatat gtatatatgt g--t — atat atatatatag 


tcgctctctc tctctctcat atatgtatat gtatatatgt gtatatatat atatatatag 


ccattatata 
ccattatata 
ccattatata 


tataqccatt 
tatagccatt 
tatagccatt 


atatatatat 
atatatatat 
atatatatat 


atattttttt 
atattttttt 
at-ttttttt 


ttcttcttct 
ttcttcttct 
tttttcttct 


t ttttaaggg 
tttttaaggg 
tttttaaggg 


ccattatata tatagccatt atatatatat atattttttt ttyttcttct tttttaaggg 

agtgacttgc tctgtcaccc aggttggagt gcagtggtgc gatttcagct cactgcagcc 

agtgactLgc tctgtcaccc aggttggagt gcagtggtgc gatttcagct cactgcdgcc 

agtgacttgc tctgtcaccc aggttggagt gcagtggtgc gatttcagct cactgcagcc 


agtgacttgc tctgtcaccc aggttggagt gcagtggtgc gatttcagct cactgcagcc 

tttgcctcoc gggttcaaga cattctccLg cctcagccgc ctgagtagct gggattacag 
tttgcctccc gggttcaaga cattctcctg cctcagccgc ctgagtagct gggactacag 
tttgcctccc gggttcaaga cattctcctg cctcagccgc ctgagtagct gggattacag 
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tt-tgcctccc gggt.Lcaaga cdt Lctcctg cctr^agccgc ctgagtaqcL gggattacag 

gcrgcccgoca ccar:acccag ctaatttttt gtatttttat tagagariggg gtttcbccac 

gogcct^gccs cciicacccag c:taattttLt gLatttLLat tagagiicggg gttLctcoac 

gcgcccgcca ccacacccag ctaatttttt gtatttttat tagagacggg gtttctccac 

gcgcccgcca ccacacccag ctaattttct gtatttttat tagagacqqg gttCccccac 

attggccagg gtggtctcga gctcctcacn tcaagtgatc cacctgcctc gqcctccuiiij 

sttqqccagg gtggtctcga gctccucacc tcaagtgatc cacctgcctc ggcctcccaa 

attggccagg gtggtctcga gctcctcacc tcaagtgatc cacctgcctc ggcctcccaa 

attggccagg gtggtctcga gctcctcacc tcaagtgatc cacctgcctc ggcctcccaa 

agtgccggaa ttacaggtgt gagccaccgc acccagccat atatttttaa atggtcttac 

agtgccggaa ttacaggtgt gagccaccgc acccagccat atatttttaa atggtcttac 

i^gtgccggaa ttacaggtgt gagccaccgc acccsgccat atatttttaa atggtcttac 

agtgccggaa ttacaggtgt gagccaccgc acccagccat atatttttaa atggtcttac 


tgagatgcaa 
tgagatgcaa 
tgagatgcaa 


ttcacatgcc 
ttcacotgcc 
ttcacatgcc 


ataccgttca 
ataccgt-tca 
ataccgttca 


cgcatttgaa gtgcagaatt 
cgcatttgaa gtgcagnatt. 
cacatttgaa gtgcagaatt 


catagttttc 
catagt-.tttc 
catagttttc 


tgagatgcaa ttcacatgcc ataccgttca crcatttgaa gtgcagaatt catagttttc 


actatattca 
actatattca 
actatattca 


cagataagtg 
cagataagtg 

cagacaagcg 


caggcttcac 
caggcttr:ac 
caggcttcac 


cacagtcgat tttagaacat 
cacagtngat tttagaacat 
cacagtcgat tttagaacat 


ttccgtcatc 
ttccgtcatc 
ttccgtcatc 


actatattca cagataagtg caggcttcac cacagtcgat tttagaacat ttccgtcatc 


tcaaaaagaa 
tcaaaaagaa 
tcaaaaagaa 


accctgcacc 
accctgcacc 
accctgcacc 


cttctgtcat 
cttctgtcat 
cttctgtcat 


cccccggccc ctggcaacag 
cccccggccc ctggcaacag 
cccccggccc ctggcaacag 


tggtctactt 
tggttrL^ctt 
tggtctactt 


tcaaaaagaa accctgcacc cttctgtcaL cccccgqucc ctggcaacag tggtctactt 


tctgtctctg 
tctgtctctg 
tctgtctctg 


cagatgtgcc 
cagatgtgcc 
cagatgtgcc 


tcttctgagc 
tcttctgagc 
tcttctgagc 


atttcatat:a atcaat<3tgt 
atttcatata atcaatatgt 
atttcatata atcaatatgt 


ggccttfctgc 
ggccctttgc 
ggccttttgc 


tctgtctctg cagatgtgcc tcttctgagc atttcatata atcaatatgt ggccttttgc 


g tctggctcc 
gtctggctcc 
gtctggctcc 


ttccccttag 
ttccccttag 
ttccccttag 


cctgaLgttt 
cctgatgttt 
cctgatgttt 


tcadgcttca tccatgtcgt 
tcaagctCca tccatgtcgt 
tcaagcttca tccatgtcgt 


agcctgtgtc 
agcctgtgtc 
agcctgtgtc 


gtctggctcc ttccccttag cctgatgttt tcaagcttca tccatgtcgt agcctgtgtc 


agtacctcgt 
agtacctcgt 
agtacctcgt 


ctctttttat 
ctctttttat 
ctctttttat 


ggccaaatca 
ggccaaatca 
ggccaaatca 


tattctgctg tgttgacaqa 
tattctgctg tgttgacaga 
tattctgctg tgttgacaga 


cgacgttttg 
cgacgttttg 
cgacgttttg 


agtacctcgt ctctttttat ggccaaatca tattctgctg tgttgacaga cgacgttttg 


ttcatctctt 
ttcatctctt 
ttcatctctt 


tatctgtgtg 
tatctgtgtg 
tatctgtgtg 


acgaacgagt 
acgaacgagt 
acgaacgagt 


qgttcttttt gttgttgttg 
ggttcttttt gttgttgttg 
ggttcttLtt gttgttgttg 


ttgttgttga 
ttgttgttga 
ttgttgttga 


ttcatctctt tatctgtgtg acgaacgagt ggttcttttt gttgttgttg ttgttgttga 


agactgtgtg 
agactgtgtg 
cigactgtgtg 


cacagaatca 
cacag^atca 
cacagaatca 


atgtttcttt 
atgtttcrtt 
atgtttcttt 


gaaaaoataa acaaatctaa 
gaaaaaataa ataaatttaa 
gaaaaaataa ataaatttaa 


aaagccctag 
aaagccctag 
aaagccctag 


agactgtgtg cacagaatca atgtttcttt gaaaaaataa ataaatttaa aaagccctag 


gtcaagagat 
gtcaagagat 
gtcaagagat 


tgagaccgtc 
tgagaccgtc 
tgagaccatc 


atggccaact 
atgyccaact 
atggccaact 


tggtgaaacc ccgtctctae 
Lggtgaaacc ccgtctctae 
tggtgaaacc ccgtctctae 


taaaaaatcc 
taaaaaatcc 
taaaaaatcc 


gtcaagagat tgagaccrtc atggccaact tggtgaaacc ccgtctctae taaaaaatcc 


aaaaattagc 
aaaaattagc 
aaaaattagc 


tgggc;3tggt 
tgggcatgqt 
tgggcatggt 


ggtgtgcgcc 
ggtqtgcqcc 
ggtgtgcQcc 


tgtagtccca gctactcggg 
tqtagtccca gctactcgqg 
tgtagtccca gctactcggg 


aggctgaggc 
aggctgaggc 
aggctgaggc 


aaaaattagc tgggcatggt ggtgtgcgcc tgtagtccca gctactcggg aggctgaggc 

agaagaattg cttgaactgq qgaggtgqag gttgeagtga gccaaqattq gtgcccctac 

ogaagaattg cttgaactgg qgaggtggag gttgeagtga gccaagattg gtgcccctac 

agaagaattg cttgaactgg ggaggtggag gttgeagtga gccaagattg gtgcccctac 
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agaagaaUtg cttgaactqg ggagqtggag gttgcagtga gccaagatLg gtqcccctac 

actccagcct ggcaacagag gqagactcca tctcaaaaaa aaaaaaaaaa aaaaagcccc 
acuccagcct ggcaacagag ggaqactcca tctcaaaaaa aaaaaaaaaa aaaaagtccc 
actoKagcct ggcaacagag ggagactcca tct.caaaaaa aaaaaaaa-- aaaaagtccc 

actccagcct ggcaacagag ggagactcca LcLudddaaa aaaaaaaaaa aaaaagtccc 

tgLTdctqatq ctqtgttqqg gctggct.tag tcccctcctg caatgctggc tggtcacatt 
tgcactgatq ctqtgttqqg gctggcttag tcpcctcctg caatgctggc tggtcacaLL 
tgcoctgatg ctgtgttggg gctggcttag tcccctcctg caatgctggc tggtcacatt 

tgcactgatg ctgtgttggg gctggcttag tcccctcctg caatgctggc tggtcacatt 

cttactgtta gatgttggag gccagggtcc cttgagggag gcrdygcagga tgtaccaggt 
cttactgtta gatgttggag gtcagggtnc cttgagggag gcaggcagga tgtaccaqqt 
Cttactgttii gatgttggag gtcagggtcc cttgagggag gcaggcagga tgtaccaggt 

cttactgtta gatgttggaq gtcagggtcc cttgagggag gcaggcagga tgtaccaggt 

gggatgttga gagcaaactg ttcaggttca ggagaggaca ctggtgctgg aggaggaggt 
gggatgttga qagca^^act.g ttcaggttca ggagaggaca ctggtgctgg aggaggaggt 
gyyiitgttqa gagcaaactg ttcaggttca ggagaggaca Ctggtgctgg aggaggaggt 

gggatgttga gagcaaactg ttcaggttca ggagaggaca ctggtgctgg aggaggaggt 

r:t-.ccctgtgc ccctgtacct cgtgggcccc gcatcctcct gcttgcacag cccgcctcag 
ctccctgtgc ccctgtacct cgtgggcccc gcatcctcct gcttgcacag cccgcctcag 
ctccctgtge eeetgtacct cgtgggcccc gcatcctcct gcttgcacag cccgcctcag 

cLccctgtgc ccctgtacct cgtgggcccc gcatcctcct gcttgcacag cccgcctcag 

ccaggagagg ccccgagccg ctccagcgtc cattcagcog gcagagtgga ccattcacct 
ccaggagagg ccccgagocg ctccagcgtc cattcagccg gcagagcggs ccattcacct 
ccaggagagg ccccgagccg ctccagcgtc cattcagccg gcagagtgga ccattcacct 


;caggagaqg ccccgagccg ctccagcgtc 


lattcagccq gcagagtgga rrcattcacct 


gtgccagctc cactctaang ttctctcaca gggcccaagg tcctgcccca gccatctccc 
gtgccagctc cactctaacg ttctctcaca gggcccaagg tcctqcccca gccatctccc 
gtgccagctc cactctaacg ttctctcaca gggcccaagg tcctgcccca gci.;atctccc 

gtgccagctc cactctaacg ttctctcaca gggcccaagg tcctgcccca gccatctccc 

cctgatccat eetecttccc cccaggtttg ccaagtacta cagagacctg getggeaacg 
cctgatccat eetecttccc ctcaggtttg ccaagtacta cagagacctg gctggcaacg 
cctgatccat eetecttccc ctcaggtttg ccaagtacta cagagacctg gctggcaacg 

cctgatccat cctccttccg ctcaggtttg c-caagtacta cagagacctg gctggcaacg 

agcagcgcac gctcatcaag gcctatggca tccgcttcga catcattgtg tttgggaagg 
agcagcgcac gctcatcaag gcctatggca tccgcttcga eateattgtg tttgggaagg 
agcagcgcac gctcatcaag gcctatggca tccgcttcga cateattgtg tttgggaagg 

agcagcgcac gctcatcaag gcptatggea tccgcttcga cateattgtg tttgggaagg 

tagctcgccg ccactggctc ccctccgtca ctccctgeag ggacaagggg cctctccctg 
tagctcgccg ccautgqotc ccctccgtca ctccctgeag ggacaagggg cctctccctg 
tagctcgccg ccactggctc ccctccgtca ctccctgeag ggacaagggg cctctccctg 

tagctcgccg ccactggctc ccctccgtca <jtt;L;ctgcaq ggacaagggg cctctccctg 

cccctgcaqa aacacttttt ttctttttcg gtgtcttggc aggcagggaa atttgacatc 
cccctgcaga aacacttttt ttctttttcg gtgtcttggc aggcaggg^ii* atttgacatc 
cccctgcaga aacacttttt ttctttttcg gtgtcttggc aggcagggja atttgacatc 

CCCCtgcaga aacacttttt ttctttttcg gtgtcttggc aggcagggaa atttgacatc 

atccccacta tgatcaacat cggctctggc ctggcactgc taggcatggt gagtggttta 
atccccacta tgateaaoat cggctctggc ctggcactgc taggcatggt gagtggttta 
atccccacta tgatcaacat cggctctggc ctggcactgc taggcatggt gagtggttta 

atccccacta tgatcaacat cggctctggc ctggcactgc taggcatggt gagtggttta 

qgccctgcct tcaccctcac ggtgagqtga gaccctgqgc tggggtcctg gtcctggccc 
qqccctgcf^t tcaccctcac ggtgaggtga gaccctgqgc tggggtcctg qtcctggccc 
ggccctgcct tcaccctcac ggtgaggtga gaccctgggc tggggtcctg qtcctggccc 

ggccctgcct tcaccctcac ggtgaggtga gaccctgggc tggggtcutg gtcctggcce 

taggccctag acctcagaLg tgtttctaaa cttgaccctc ctaccttctt tcccttggcc 
taggccctag acctcagatg tgtttctaaa cttgaccctc ctaccttctt tcccttggcc 
taggccctag acctcagatg tgtttctaaa cttgaccctc ctaccttctt tcccttggcc 
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taggccctag acctcagatg tgtttctaaa cttg««:cte ctaccccctt tccr.ttgtjcc 


cccagccctc ctcrcacctt 
cccaqccetc cccccacctc 
ccKaqcectt^ ctcccacctt 


cccttctcca 
cccttctcca 
cccttctcca 


aqaccacccc 
agaccacccc 
agaccacccL: 


cctcaatftcc 
cctcaggccG 
cctcagqtc'c 


cagccttctc 
cagcctcccc 

.cagccLtiJt-c; 


-ccttctcca Hgaccacccc cr:tcaqgtcc (Cagccttctc 


ccoagagatg ggagtgcctt 
ccaagagatq godgtgcctt 
ccaagaga-tg gaagV.gcctt 


cccattccgg 
t[jcattccqq 
tccattccqg 


taaagattcc 
Laaaqattcc 
taaagattcc 


aggcttctca 
aggcttctca 
aggcLLctca 


ggaaggggca 
gyaagggqca 
ggaaggggca 


ccaagagatg ggagtgcctt tccattccgg taaagattcc aggcttctca ggaaggggca 


cgcaaagaac aagatgggtt 
cgcaaagaat aagatgggtt 
cgcaaagaat aagatgggtt 


gatgggttgc 
gattiggttgc 
gatgggttqc 


aagcatcctg 
aaguaLcctg 
aagcatcctg 


gctcactctc 
gctcactctc 
gctcactctc 


accctatgct 
sccctdtgct 
accctatgct 


cgcaaagaat aagatgggtt ga-tggerttgc aagcatcctg gctcactctc accctatgct 


aaactcaggc gaccgtqctg 
aaactcaggc gaccgtgctg 
aaactcaggc gaccgtgctg 


tgtgacatca 
tgtgacatca 
tgtgacatra 


tagtcctcta 
tagtcctcta 
tagtcctcta 


ctgcatgaag 
ctgcatgaag 
ctgcatgaag 


aaaagacLct 
aaaagactct 
aaaagactct 


aaactcaggc gaccgtgctg tgtgacatca tagtccTcta ctgcatgaag aaaagactct 


actatcggga gaagaaatat 
actatcggfja gaagaaataf 
actatcggga gaagaaatat 


aaatatgtgg 
aaatatqtgg 
aaatatgtgg 


aagattacga 
aagattacga 
aagattacga 


gcsggtaggc 

gcaggtaggr: 
gcaggtaggc 


ccctcctggc 

ccctcctggc 
ccctcctggc 


actatcggga gaagaaatat aaatatgtgg aaqattacgd gcaggtaggu ccctcctgqc 


ccccagcagg cacaggcctc 
ccccagcagg cacaggcctc 
ccccagcagg cacaggcctc 


tcatctcttg 
tcatctcctg 
tcatctcttg 


ggtgtggfjag 
ggtgtgggag 
ggtgtgggag 


czcct gggc;gt 

ccctgggcgt 
ccctgggcgt 


gggcctgtot 
gggcctgrct 
gggcctgtct 


ccccagcagg cacaggcctc tcatctcttg ggtgtgggag ccctg<)9cgt gggcctgtct 


ggggaggccc ttctgcagag 
ggggaggccc ttctgcagag 
ggggaggccc ttctgcagag 


gctggcacca 
gctggcacca 
gctggcacca 


gtgtggcgtg- 
gtgtggcgtg 
gtgtggcgtg 


gtgtccccgt 
gtgtccccgt 
gtgtccccqt 


taacccgggc 
taacccgggc 
taacccqqgc 


ggggaggccc ttctgcagag gctggc; 


gtgtggcgtg gtgtccccgt taacccgggc 


agtcctgcca ctctcagcag 
agtcctgcca ctctcagcag 
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ccagaaaagctgggagcagatgctgggaagaaaacaacggatgtttgctacagtgtattt 
taaactaaactagacaggttcatcctggcgcgg 
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DIAGNOSING PREDISPOSITION TO FAT 
DEPOSITION AND THERAPEUTIC METHODS 

FOR REDUCING FAT DEPOSITION AND 
TREATMENT OF ASSOCIATED CONDITIONS 


CROSS-REFERENCE TO RELATED 
APPLICAnONS 

[0001] This application claims benefit of U.S. Serial No. 
60/386,012 filed Jun. 4, 2002. The contents of that applica- 
tion is incorporated herein by reference. 

FIELD OF THE INVENTION 

[0002] The invention relates to genetic alterations in 
nucleic acids that are associated with fat deposition and 
diabetes. 

BACKGROUND 

[0003] Individuals who are obsese have excess body fat 
compared to set standards. Obesity can be determined by 
several methods including body mass index (BMI) measure- 
ments, weight-for height charts, and body fat measurements 
determined by skinfold thickness and bioelectrical imped- 
ance. Obesity affects 58 million people across the United 
States, which represents approximately one-quarter to one- 
third of the adult population, and its prevalence is increasing 
to epidemic proportions in the United States and in other 
industrialized nations. 

[0004] Recognized since 1985 as a chronic disease, obe- 
sity-related medical conditions contribute to approximately 
300,000 deaths each year, second only to smoking as a cause 
of preventable death. (JAMA, 276: 1907-1915 (1996)). 
Obesity has been established as a major risk factor for type 
II diabetes melitus, hypertension, cardiovascular disease and 
some cancers in both men and women (JAMA, 282: 1523- 
1529 (1999)). Other comorbid conditions include sleep 
apnea, osteoarthritis, infertility, idiopathic intracranial 
hypertension, lower extremity venous stasis disease, gastro- 
esophageal refiux and urinary stress incontinence. 

[0005] The total cost attributable to obesity amounted to 
$99.2 billion in 1995. Approximately $51.65 billion of those 
dollars were direct medical costs. The cost of obesity to U.S. 
business in 1994 was estimated to total $12.7 billion, and 
health-related economic costs of obesity to businesses in the 
United States is substantial, representing approximately 5% 
of total medical care costs. (American Journal of Health 
Promotion, 13 (2): 120-127 (1998)). It was found that as 
BMI increases, so do the number of sick days, medical 
claims and health care costs and that the mean annual health 
care costs for the BMI "at risk" population was $2,274 
versus $1,499 for the "not at risk" group. 

[0006] An accumulation of adipose tissue on the trimk and 
around the waist, known as central fat, also confers an 
increased risk of type 11 diabetes and cardiovascular disease 
(Lundgren et al.. Int. J. Obes., 13(4): 413-23 (1989); Ohlson 
et dil.. Diabetes, 34(10): 1055-8 (1985)). In addition, central 
obesity has been implicated in a condition known as the 
metabolic ssmdrome (or syndrome X), which is associated 
with increased risk of cardiovascular disease, vascular 
dementia, and diabetes. Tlie metabolic syndrom is a descrip- 
tive term for the coexistence of aU of the following or 
differing combinations of central fat, hypertension, glucose 


intolerance, dyslipidemia (elevated triglycerides and low 
HDL cholesterol), and impaired insulin stimulated glucose 
uptake ("insulin resistance"). Prevalence of central fat and 
its relationship to general obesity differs between ethnic 
groups and gender (McKeigue et al., Diabetologia, 35(8): 
785-91 (1992); McKeigue et al.. Lancet, 337(8738): 382-6 
(1991)). A majority of male subjects having high central fat 
are also obese in terms of BMI, and obese subjects often 
have a central distribution of fat, which suggests an overlap 
between these two conditions. While this relationship is not 
as strongly correlated in women, central fat increases after 
menopause. 

[0007] Current anti-obesity therapeutics (e.g., Phenter- 
mine, Sibutramine, and Orlistat) are largely ineffective and 
there is an urgent need to define the etiology of this disease 
and initiate rational, mechanism-based drug development. 
Mouse QTL and human studies have postulated that the 
12q22 to q23 region, and specifically the insulin-like growth 
factor 1 (IGF-1) gene in that region, play a role in body 
weight regulation and viseral fat deposition (Collins, A. C. 
et al., Mam/n. Genome, 4: 454-458 (1993); Sun, G. et aV.Jnt. 
J. Obes., 23: 929-935 (1999); Keightley, P. D. et al.. Genet- 
ics, 142: 227-235 (1996). Also, other studies have linked 
obesity with certain portions of the human genome (Perusse, 
L. et al., Obesity Research, 9: 135-169 (2001); Chagnon, Y. 
C. et al., Obesity Research, 8: 89-117 (2000)). Specifically, 
the CD36L gene on chromosome 12 was implicated in 
plasma lipid levels and with BMI (Acton, S. et al., Arterio- 
scler. Tkromb. Vase. Biol, 19: 1734-1743 (1999)), the 12q24 
chromosomal region was postulated as playing a role in 
obesity ia a Quebec Family Study (Perusse, L. et al.. 
Diabetes, 50: 614-621 (2001)), and it was reported that 
certain polymorphic loci on chromosome four are associated 
with obesity (Stone et al,, American J Hitman Genetics, "A 
major predisposition locus for severe obesity at 4pl5-pl4," 
June 2002). 

SUMMARY 

[0008] It has been discovered that polymorphic variations 
in a gene encoding a purinergic receptor known as P2X4, 
which is located on chromosome twelve, have been associ- 
ated with central fat deposition. Thus, featured herein is a 
method for diagnosing predisposition to fat deposition or 
leanness in a subject, which comprises detecting the pres- 
ence or absence of one or more polymorphic variations in a 
P2X4 nucleotide sequence in a nucleic acid sample from a 
subject, where the P2X4 nucleotide sequence is set forth as 
SEQ ID NO: 1 or a substantially identical nucleotide 
sequence thereof. In certain embodiments, polymorphic 
variations at positions 11030, 15847, or 17338 in SEQ ID 
NO: 1 are detected. 

[0009] Also featured herein are nucleic acids that encode 
a P2X4 polypeptide and include one or more polionorphic 
variations at positions 11030, 15847, or 17338 of SEQ ID 
NO: 1, and oligonucleotides which hybridize to those 
nucleic acids. Also featured are methods for identifjdng 
candidate therapeutic molecules that reduce fat deposition 
and treat related disorders, as well as methods of reducing 
fat deposition and treating related disorders in a subject by 
administering such a therapeutic molecule. 

[0010] In addition, it was discovered that a polymorphic 
variation at position 15847 was associated with type II 
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diabetes (non-insulin dependent diabetes mellitus, or 
NIDDM) in subjects. Therefore, featured herein is a method 
for diagnosing predisposition to NIDDM in a subject, which 
comprises obtaining a nucleic acid sample from the subject 
and detecting the presence or absence of a polymorphic 
variation in a P2X4 nucleotide sequence associated with 
NIDDM. Also featured are methods for identifying candi- 
date therapeutic molecules for treating NIDDM in a subject 
and methods for treating NIDDM in a subject by adminis- 
tering such a therapeutic molecule. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0011] FIGS. lA-lL depicts the P2X4 nucleotide 
sequence reported as SEQ ID NO:l. The following nucle- 
otide representations are used throughout: "A" or "a" is 
adenosine, adenine, or adenylic acid; "C" or "c" is cytidine, 
cytosine, or cytidylic acid; "G" or "g" is guanosine, guanine, 
or guayUc acid; "T"' or "t" is th3niiidine, thymine, or 
thymidylic acid, and 'T" or "i" is inosine, hypoxanthine, or 
inosinic acid. Exons are indicated in italicized lower case 
type, introns are depicted in normal text lower case type, and 
polymorphic sites are depicted in bold upper case type. The 
sequence designates polymorphic ^tes at positions 2878, 
11030, 15847, and 17338, where a GGGCCCC nucleotide 
sequence insert was identified within positions 2878 to 2891 
in a minority of samples (not shown in the figure), and where 
single nucleotide polymorphisms (SNPs) are indicated at 
positions 11030, 15847, and 17338. SNPs are designated by 
the following convention: "R" represents A or G, "M" 
represents A or C; " W" represents A or T; " Y" represents C 
or T; "S" represents C or G; "K" represents G or T; "V" 
represents A, C or G; "H" represents A, C, or T; "D" 
represents A, G, or T; "B" represents C, G, or T; and "N" 
represents A, G, C, or T. There are three transcription 
products and two translation products that can be expressed 
from the nucleotide sequence as it is a dicistronic sequence. 
The first transcription product is encoded by a nucleic acid 
that starts at position 3027 and ends at position 36419, where 
exon 1 begins at position 2719 and ends at position 3160, 
exon 2 beings at position 9997 and ends at position 10144, 
exon 3 begins at position 14776 and ends at position 14847, 
exon 4 begins at position 14957 and ends at position 15029, 
exon 5 begins at position 15810 and ends at position 15906, 
exon 6 begins at position 21398 and ends at position 21478, 
exon 7 begins at position 21590 and ends at position 21731, 
exon 8 begins at position 21828 and ends at position 21964, 
exon 9 begins at poi^tion 25286 and ends at position 25379, 
exon 10 begins at position 25483 and ends at position 25548, 
exon 11 begins at position 25869 and ends at position 25964, 
and exon 12 begins at position 26395 and ends at position 
26616. The second transcription product is encoded by a 
nucleic acid that also starts at position 3027 and ends at 
position 26419, where exon 1 begins at position 3001 and 
ends at position 3160, exon 2 beings at position 9997 and 
ends at position 10144, exon 3 begins at position 14776 and 
ends at position 14847, exon 4 begins at position 14957 and 
ends at position 15029, exon 5 begins at position 15810 and 
ends at position 15906, exon 6 begins at position 21398 and 
ends at position 21478, exon 7 begins at position 21590 and 
ends at position 21731, exon 8 begins at position 21828 and 
ends at position 21964, exon 9 begins at position 25286 and 
ends at position 25379, exon 10 begins at position 25483 and 
ends at position 25548, exon 11 begins at position 25869 and 
ends at position 25964, and exon 12 begins at position 26395 


and ends at position 26616. The third transcription product 
is encoded by a nucleic acid which begins at position 14797 
and ends at position 16419, where exon 1 begins at position 
3129 and ends at position 3160, exon 2 beings at position 
9997 and ends at position 10144, exon 3 begins at position 
14776 and ends at position 14847, exon 4 begins at position 
14957 and ends at position 15029, exon 5 begins at position 
15810 and ends at position 15906, exon 6 begins at position 
21398 and ends at position 21478, exon 7 begins at position 
21590 and ends at position 21731, exon 8 begins at position 
21828 and ends at position 21964, exon 9 begins at position 
25286 and ends at position 25379, exon 10 begins at position 
25483 and ends at position 25548, exon 11 begins at position 
25869 and ends at position 25964, and exon 12 begins at 
position 26395 and ends at position 26974. 

[0012] FIGS. 2A-2HH depicts an alignment between 
nucleotide sequences reported in NCBI and Ensembl data- 
bases used to determine the nucleotide sequence of SEQ ID 

NO: 1. 

[0013] FIG. 3A shows a polypeptide sequence encoded by 
the first translation start site at position 3027 of SEQ ID 
NO:l and is a translation product of the first or second 
transcription product. The polypeptide in FIG. 3A is 87% 
identical to R. norvegicus P2X4, which is 387 amino acids 
in length (sp:P51577) and 80% identical to M. musculus 
P2X4, which is also 387 amino acids in length (ref:NP_ 
035156.1, DNA segment, Chr 5, ERATO Doi 444). FIG. 3B 
shows a polypeptide sequence encoded by the third tran- 
scription product, which begins after the first tibosome entry 
site. 

[0014] FIG. 4 depicts a tissue expression profile for P2X4. 

[0015] FIG. 5 shows a nucleotide sequence insert (SEQ 
ID NO:) that is optionally included in SEQ ID NO:l 

between positions 19137 and 19138. 

[0016] FIG. 6 shows a plot of pair wise linkage disequi- 
librium estimates D' and pairwise linkage disequifibrium 
estimates for polymorphisms identified in and around the 
F2X4 gene. Dark colored squares represent high linkage 
diseqiulibrium (LD) between SNPs, and no color or t^ht 
colored squares represent low LD between SNPs, 

DETAILED DESCRIPTION 

[0017] It has been discovered that polymorphic variants in 
or near a gene on chromosome 12 encodii^ a purinergic 
receptor are associated with fat deposition in the abdomen 
and trunk region of subjects. Individuals having increased 
fat deposition in this area are at risk of developing metabolic 
conditions (e.g., diabetes and obesity) and cardiovascular 
conditions (e.g., hypertension). Thus, methods for detecting 
genetic determinants for fat deposition can lead to early 
diagnosis of a predisposition to these conditions (e.g., hyper- 
insulinaemia, hypertension, glucose intolerance (that is, IGT 
or diabetes), dysfipidemia, hypercoagulabihty and microal- 
buminuria) and early prescription of preventative measures. 
Also, associating P2X4 with fat deposition has provided a 
new target for screening molecules useful for treatments that 
reduce fat deposition. P2X4 is also a target for screening 
molecules useful for treatii^ disorders associated with fat 
deposition, which include metabolic disorders (e.g., diabetes 
and obesity) and cardiovascular disorders (e.g., hyperten- 
sion). 
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[0018] Puriaergic receptors are polypeptides located in 
cell membranes capable of binding adenosine dipho^hate 
(ADP) and adenosine triphosphate (ATP). These receptors 
regulate biological processes including neurotransmission, 
muscle contraction, and endocrine secretion. There are two 
families of purioogenic receptors: the P2Y family of G-pro- 
tein coupled receptors the P2X family of ligand-gated cation 
channels. P2X4 is one of seven members in the P2X family 
of purinergic receptors, where activation of this receptor 
leads to opening of non-selective cation channels permeable 
to Na*, or Ca^-^ (Xlong et aL,5K /. Pharmacol, 130(6): 
1394-1398 (2000)). P2X4 cDNA has been isolated from 
human brain and mapped to chromosomal locus 12q24.32. 
This cDNA encodes a polypeptide having 87% sequence 
identity to rat P2X4. Splice variants of human P2X4 (Dbuli- 
pala et al., Gene, 207: 259-266 (1998)) and mouse P2X4 
(Townsend-Nicholoson et al.. Brain Res. Mol Brain Res., 
64: 246-254 (1999)) have been identified. 

[0019] P2X4 is expressed in neuronal and non-neuronal 
tissues, and expression was shown in the hippocampus of the 
brain and in pancreatic islets in the rat (Wang et al., 
Biochem. Biophys. Res. Commun., 220: 196-202 (1996)). 
Also, P2X4 is localized in adrenal cortical cells of the zona 
reticularis in adult rats, and it has been hypothesised that 
circadian variation and corticosteroid secretion by the adre- 
nal cortex may effect expression levels of puriaergic recep- 
tors (Afework and Burnstock, /«r. J. Devi Neuroscience, 18: 
515-520 (2000)). P2X4 has been distinguished from other 
P2X members on the basis of its relative insensitivity to 
many P2X receptor antagonists (BueU et al., EMBO J.^ 15: 
55-62 (1995); CoUo et al.. Journal of Neuroscience, 16: 
lA^S-IS^l (1996)). It was reported that ethanol inhibits 
P2X4 function in Xenopus oocytes, suggesting that P2X4 
may play a role in the action of ethanol on the central 
nervous system (Kiefer et zX.,Alc<A%ol Clin. Exp. Res., 25(5): 
787-789 (2001)). 

[0020] Central Fat Deposition and Associated Conditions 

[0021] Many individuals considered as having increased 
central fat deposition are also considered obese according to 
BMI, weight-for-height charts, or body fat measurements. 
Obesity is generally understood as a condition where fat 
content in an individual is above a predetermined level. For 
example, the National Institute of Diabetes and Digestive 
and Kidney Diseases (NIDDK) of the National Institutes of 
Health (NM, see http address www.nih.gov/health/nutrit/ 
pubs/statobes.htm) define individuals having a body mass 
index (BMI) of 25 to 29.9 kg/m^ as being overweight and 
individuals having a BMI of 30 kg/m^ or greater as beir^ 
obese. 

[0022] Increased central fat levels also have been linked to 
the metabolic syndrome, which includes the coexistence or 
one or more life threatening medical conditions such as 
metabolic conditions (e.g., diabetes and obesity) and car- 
diovascular conditions (e.g., myocardial infarction and 
hypertension). For example, cardiovascular mortaUty was 
assessed in 3,606 subjects from the Botnia study (a large- 
scale study of type 2 diabetes begun in Finland in 1990) with 
a median follow-up of 6.9 years. In women and men, 
respectively, the metaboUc syndrome was recorded in 10 and 
15% of subjects with normal glucose tolerance, 42 and 64% 
of those with IFG/IGT, and 78 and 84% of those with type 
2 diabetes. The risk for coronary heart disease and stroke 


was increased threefold in subjects with the syndrome, and 
cardiovascular mortality was markedly increased (12.0% in 
subjects with the syndrome versus 2.2% in those without; 
P<0.001) (Zimmet, et al. (2001) Nature 414: 782-787). 
Thus, determining a predisposition to fat deposition, and 
specifically central fat deposition, is useful for determining 
whether a person should be considered for being placed on 
a preventative regimen for reducing fat, thereby reducing the 
probability that the person develops one or more conditions 
linked to fat deposition. 

[0023] The term "fat deposition" as used herein refers to 
fat content in an individual as well as processes in which fat 
is deposited in certain locations of an individual. The term 
"central fat deposition" as used herein refers to fat around 
the truiLk and waist of an individual that is above a prede- 
termined level or average in a population. The central region 
may be defined as the region extending from the superior 
surface of the second lumbar vertebra extending inferiorly to 
the inferior surface of the fourth lumbar vertebra and later- 
ally to the inner aspect of the ribcage. Fat deposition can be 
measured as a quantity at one time point or a quantity over 
a series of time points, for example, and fat deposition can 
be quantified or estimated using a number of procedures 
described hereafter. Fat is composed of adipose cells depos- 
ited below the skin (i.e., subcutaneous adipose cells) and/or 
deeper within an individual's body (i.e., visceral adipose 
cells). Adipose cells often are connective tissue cells spe- 
cialized for synthesis and storage of fat. Such cells often 
contain globules of toiglycerides where the nucleus is gen- 
erally di^laced to one side of the globule and the cytoplasm 
is visuahzed as a thin line around the fat droplet. Provided 
herein are methods for detecting predisposition to overall 
adipose cell deposition in a. subject (i.e., includes subcuta- 
neous adipose cells and visceral adipose cells), as well as 
methods for distinguishing between a predisposition to 
subcutaneous adipose cell deposition and a predisposition to 
visceral adipose cell deposition. 

[0024] Fat deposition is quantified in a number of marmers 
(see, e.g., Wajchenberg, Endocrine Rev. 21(6): 697-738 
(2000)). For example, caliper measurements of skinfold 
thickness in defined areas of the body have been utilized to 
differ between different kinds of regional fat (Nordhamn, et 
aUInt. J. Obes. Relat. Metab. Disord 24(5): 652-7 (2000)). 
Waist and hip measurements using tape measures are com- 
monly utilized indices of central fat (Lundgren et al., Int. J. 
Obes., 13(4): 413-23 (1989); Ohlson et al.. Diabetes 34(10): 
1055-8 (1985)), and sagittal abdominal diameter is mea- 
sured by some researchers for quantifying central fat. Also, 
computed tomography and X-ray based methods have been 
utilized to quantify central fat content. Dual x-ray absorb- 
tiometry (DEXA) is relatively fast and inexpensive and 
yields reliable estimations of body composition (fat mass/ 
lean mass/bone) with reproducibility. DEXA measurements 
and waist and hip measurements were utilized for quanti- 
fying central fat in Example 1. Magnetic resonance imaging 
(MRI) and computed tomography procedures can be used to 
distinguish between visceral fat deposition and subcutane- 
ous fat deposition (see e.g. Wajchenberg, supra). 

[0025] Thus, fat deposition can be expressed in units used 
for quantifying fat content. Fat deposition can be expressed 
in terms of total fat content in an individual or region of an 
individual (grams or percentage of total weight of an indi- 
vidual), visceral fat content in an individual or region of an 
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individual (grams, percentage of total weight of an indi- 
vidual, or percentage of total fat in an individual), and 

subcutatnous fat content in an individual or region of an 
individual (grams, percentage of total weight of an indi- 
vidual, or percentage of total fat in an individual). Each of 
these expressions of fat deposition can be measured or 
quantified at a single point in time or over two or more points 
in time. 

[0026] Fat deposition also can be expressed in terms of 
"increased fat deposition" (also referred to as "higher fat 
deposition" and "at increased risk for fat deposition"), which 
is relative to average fat deposition in a population. In a 
distribution of fat deposition across a population (expressed 
in any of the units of measure described herein), individuals 
having increased fat deposition are sometimes represented in 
the upper 40% or upper 30% of the population, often in the 
upper 25%, upper 20%, upper 15%, and upper 10% of the 
population, and sometimes in the upper 5% of the popula- 
tion. Also, individuals having increased fat deposition can be 
characterized as having waist/hip ratios of 1.01 or more for 
males and 0.91 or more for females. In addition, men or 
women having a BMI between 25 and 30 or between about 
1335 and about 2050 grams of central fat are typically 
considered overweight, and individuals having a BMI over 
30 or over about 2050 grams of central fat are normally 
considered obese (e.g., grams of central fat can be deter- 
mined by DEXA, as descibed above). Also, "leanness" or 
"decreased fat deposition" (also referred to as "lower fat 
deposition" and "at decreased risk for fat deposition") are 
terms that refer to fat deposition and are also relative to 
average fat deposition in a population. In a distribution of fat 
deposition across a population, lean individuals are some- 
times represented in the lower 40% or lower 30% of the 
population, often in the lower 25%, lower 20%, lower 15%, 
and lower 10% of the population, and sometimes in the 
lower 5% of the population. Also, lean individuals can be 
characterized as having waist/hip ratios of 1.00 or less for 
males and 0.90 or less for females. In addition, men or 
women having a BMI of 24 or less or less than about 1334 
grams of central fat are normally considered lean. 

[0027] The term "metabolic condition" as used herein 
refers to a disease, disorder, or state involving increased or 
decreased metabolites relative to a population average. 
Examples of metabolic disorders include but are not limited 
to diabetes, obesity, anorexia nervosa, cachexia, and lipid 
disorders. The new paradigm relating to central obesity and 
type 2 diabetes also influences contemporary therapy for the 
disease. Evidence now exists for a far more aggressive 
aj^roach to treating not just the hjrperglycaemia, but also 
other cardiovascular disease risk factors such as hyperten- 
sion, dyslipidaemia and central obesity in type 2 diabetic 
patients, with the hope of significantly reducing cardiovas- 
cular morbidity and mortality. 

[0028] The term "NIDDM" as used herein refers to non- 
insulin-dependent diabetes melUtus or Type 2 diabetes (the 
two terms are used interchangeably throughout this docu- 
ment). NIDDM refers to an insuHn-related disorder in which 
there is a relative disparity between endogenous insuHn 
production and insulin requirements, leading to elevated 
hepatic glucose production, elevated blood glucose levels, 
inappropriate insulin secretion, and peripheral insulin resis- 
tance. 


[0029] The term "cardiovascular condition" as used herein 
refers to a disease, disorder, or state involving the cardio- 
vascular system, e.g., the heart, the blood vessels, and/or the 
blood. A cardiovascular disorder can be caused by an 
imbalance in arterial pressure, a malfunction of the heart, or 
an occlusion of a blood vessel (e.g., by a thrombus). Other 
examples of cardiovascular disorders include but are not 
limited to hypertensioa, atherosclerosis, coronary artery 
spasm, coronary artery disease, arrhythmias, heart failure, 
including but not limited to, cardiac hjrpertrophy, left-sided 
heart failure, and right-sided heart failure; ischemic heart 
disease, including but not limited to angina pectoris, myo- 
cardial infarction, chronic ischemic heart disease, and sud- 
den cardiac death; hypertensive heart disease, includii^ but 
not limited to, systemic (left-sided) hypertensive heart dis- 
ease and pulmonary (right-sided) hypertensive heart disease; 
valvular heart disease, including but not limited to, valvular 
degeneration caused by calcification, such as calcification of 
a congenitally bicuspid aortic valve, and mitral annular 
calcification, and myxomatous degeneration of the mitral 
valve (mitral valve prolapse), rheumatic fever and rheumatic 
heart disease, infective endocarditis, and noninfected veg- 
etations, such as nonbacterial thrombotic endocarditis and 
endocarditis of systemic lupus erythematosus (Libman- 
Sacks disease), carcinoid heart disease, and complications of 
artificial valves; myocardial disease, including but not lim- 
ited to dilated cardiomyopathy, hypertrophic cardiomyopa- 
thy, restrictive cardiomyopathy, and myocarditis; pericardial 
disease, including but not limited to, pericardial effusion and 
hemopericardium and pericarditis, including acute peri- 
carditis and healed pericarditis, and rheumatoid heart dis- 
ease; neoplastic heart disease, including but not limited to, 
primary cardiac tumors, such as myxoma, Upoma, papillary 
fibroelastoma, rhabdomyoma, and sarcoma, and cardiac 
effects of noncardiac neoplasms; congenital heart disease, 
including but not limited to, left-to -right shunts (late cyano- 
sis, such as atrial septal defect, ventricular septal defect, 
patent ductus arteriosus, and atrioventricular septal defect, 
right-to-left shunts), early cyanosis (e.g., tetralogy of fallot, 
transposition of great arteries, truncus arteriosus, tricuspid 
atresia, and total anomalous pulmonary venous connection), 
obstructive congenital anomalies (e.g., coarctation of aorta, 
pulmonary stenosis and atresia, and aortic stenosis and 
atresia), disorders involving cardiac transplantation, and 
congestive heart failure. 

[0030] Polymorphic Variants Associated with Fat Deposi- 
tion 

[0031] A genetic analysis provided herein linked fat depo- 
sition with polymorphic variants of a nucleotide sequence 
located on chromosome twelve that encodes a purinergic 
receptor designated P2X4. An additional genetic analysis 
provided herein associated NIDDM with genetic variability 
of the P2X4 nucleotide sequence. As used herein, the term 
"polymorphic site" refers to a region in a nucleic acid at 
which two or more altemative nucleotide sequences are 
observed in a significant number of nucleic acid samples 
from a population of individuals. A polymorphic site may be 
a nucleotide sequence of two or more nucleotides, an 
inserted nucleotide or nucleotide sequence, a deleted nucle- 
otide or nucleotide sequence, or a microsatelKte, for 
example. A polymorphic site that is two or more nucleotides 
in length may be 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or 
more, 20 or more, 30 or more, 50 or more, 75 or more, 100 
or more, 500 or more, or about 1000 nucleotides in length. 


us 2004/0018533 Al 


5 


Jan. 29, 2004 


where all or some of the nucleotide sequences differ within 
the region. A polymorphic site often is one nucleotide in 
length, which is referred to herein as a "single nucleotide 
polymorphism" or a "SNR" 

[0032] Where there are two, three, or four alternative 
nucleotide sequences at a polymorphic site, each nucleotide 
sequence is referred to as a "polymorphic variant." Where 
two polymorphic variants exist, for example, the poljnuor- 
phic variant represented in a minority of samples from a 
population is sometimes referred to as a "minor allele" and 
the polymorphic variant that is more prevalently represented 
is sometimes referred to as a "major allele/' Many organ- 
isms possess a copy of each chromosome (e.g., humans), 
and those individuals who possess two major alleles or two 
minor alleles often are referred to as being "homozygous" 
with respect to the polymorphism and those individuals who 
possess one major allele and one minor allele are normally 
referred to as being "heterozygous" with respect to the 
polymorphism. Individuals who are homozygous with 
respect to one allele are sometimes predisposed to a different 
phenotype as compared to individuals who are heterozygous 
or homozygous with respect to another allele. As used 
herein, the term "phenotype*' refers to a trait which can be 
compared between indviduals, such as presence or absence 
of a condition, a visually observable difference in appear- 
ance between individuals, metabolic variations, physiologi- 
cal variations, variations in the function of biological mol- 
ecules, and the like. Examples of phenotypes are fat 
deposition, obesity, and diabetes. 

[0033] Researchers sometimes report a polymorphic vari- 
ant in a database without determining whether the variant is 
represented in a significant fraction of a population. Because 
a subset of these reported polymorphmic variants are not 
represented in a statistically significant portion of the popu- 
lation, some of them are sequencing errors and/or not 
biologically relevant. Thus, it often is not known whether a 
reported polymorphic variant is statistically significant or 
biologically relevant until the presence of the variant is 
detected in a population of individuals and the frequency of 
the variant is determined. Methods for detecting a polymor- 
phic variant in a population is described herein, specificaUy 
in Example 2. A polymorphic variant is statistically signifi- 
cant and often biologically relevant if it is represented in 5% 
or more of a population, sometimes 10% or more, 15% or 
more, or 20% or more of a population, and often 25%, 30% 
or more, 35% or more, 40% or more, 45% or more, or 50% 
or more of a population. 

[0034] A polymorphic variant may be detected on either or 
both strands of a double-stranded nucleic acid. Also, a 
polymorphic variant may be located within an intron or exon 
of a gene or within a portion of a regulatory region such as 
a promoter, a 5' untranslated region (UTR), a 3 UTR, and in 
DNA (e.g., genomic DNA (gDNA) and complementary 
DNA(cDNA)), RNA(e.g., mRNA, tRNA, and rRNA), or a 
polypeptide. Polymorphic variations may or may not result 
in detectable differences in gene expression, polypeptide 
structure, or polypeptide function. 

[0035] In the genetic analysis that associated polymorphic 
variations in P2X4 with fat deposition and NIDDM, samples 
from individuals in a population of twin pairs were geno- 
typed, although other populations could be subjected to 
analysis. The term "genotyped" as used herein refers to a 


process for determining a genotype of one or more indi- 
viduals, where a "genotype" is a representation of polymor- 
phic variants in a population. Fat deposition was quantified 
in the central region of individuals in the study group, and 
SNPs were identified at positions 11030, 15847, and 17338 
in the P2X4 nucleotide sequence represented by SEQ ID 
NO: 1. It was determined that 33% of the individuals tested 
in the genetic analysis had a thymine at position 11030 and 
64% of the indivduals had an adenine at this position. At 
position 15847, 55% of the individuals had a thymine and 
45% of the individuals had a cytosine, and at position 17338, 
76% of the individuals had a cytosine and 24% of the 
individuals had a thymine. It was determined that an adenine 
at position 11030, a thymine at position 15847, or a cytosine 
at position 17338 of SEQ ID NO:l were individually 
associated with central fat deposition, and the presence of a 
thymine at position 11030, a cytosine at position 15847, or 
a thymine at position 17338 of SEQ ID NO:l were indi- 
vidually associated with leanness. 

[0036] Furthermore, a genotype or polymorphic variant 
may be expressed in terms of a "hap lo type," which as used 
herein refers to two or more polymorphic variants occurring 
within genomic DNA in a group of individuals within a 
population. For example, two SNPs may exist within a gene 
where each SNP position includes a cytosine variation and 
an adenine variation. Certain individuals in a population 
may carry one allele (heterozygous) or two alleles (homozy- 
gous) having the gene with a cytosine at each SNP position. 
As the two cytosines corresponding to each SNP in the gene 
travel together on one or both alleles in these individuals, the 
individuals can be characterized as having a cytosine/cy- 
tosine haplotype with respect to the two SNPs in the gene. 

[0037] The genetic analysis identified haplotypes associ- 
ated with lower risk of fat deposition. In particular, presence 
of a haplotype represented by TTAT or TCAT at positions 
11030, 15847, 21708, and 22713, respectively, in the P2X4 
sequence represented by SEQ ID NO: 1 were associated 
with low central fat deposition. As used herein, a "haplo- 
type" refers to a combination of polymorphic variations in a 
defined region within a genetic locus on one of the chro- 
mosomes in a chromosome pair. 

[0038] In the genetic analysis that associated a polymor- 
phic variation in P2X4 with NIDDM, samples from indi- 
viduals in a population of diabetics and non-diabetics were 
genotyped. NIDDM can be diagnosed in patients by mea- 
suring fasting plasma glucose levels and/or casual plasma 
glucose levels, measuring fasting plasma insulin levels 
and/or casual plasma insulin levels, or administering oral 
glucose tolerance tests or hyperinsulinemic euglycemic 
clamp tests. A SNP at position 15847 in the P2X4 nucleotide 
sequence represented by SEQ ID NO: 1 was genotyped in the 
diabetes case and control populations. It was determined that 
33% of the individuals tested in the genetic analysis had a 
cytosine at position 15847 and 64% of the indivduals had a 
thymine at this position. It was determined that in females a 
thymine at position 15847 of SEQ ID NO: 1 was individually 
associated with NIDDM, and in females the presence of a 
cytosine at position 15847 was individually associated with 
a not having NIDDM. 

[0039] Additional Polymorphic Variants Associated with 

Fat Deposition 

[0040] Also provided is a method for identifying poly- 
morphic variants proximal to an incident, founder polymor- 
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phic variant associated with fat deposition, obesity and 
NIDDM. Thus, featured herein are methods for identifying 
a polymorphic variation associated with fat deposition or 
NIDDM that is proximal to an incident polymorphic varia- 
tion associated with fat deposition or NIDDM, which com- 
prises identifying a polymorphic variant proximal to the 
incident polymorphic variant associated with fat deposition 
of NIDDM, where the incident polymorphic variant is in a 
P2X4 nucleotide sequence. The P2X4 nucleotide sequence 
often comprises a polynucleotide sequence selected from the 
group consisting of (a) a polynucleotide sequence set forth 
in SEQ ID NO: 1; (b) a polynucleotide sequence that 
encodes a polypeptide having an amino acid sequence 
encoded by a nucleotide sequence set forth as SEQ ID NO: 
1; or (c) a pol3mucleotide sequence that encodes a polypep- 
tide having an amino acid sequence that is 90% identical to 
an amino acid sequence encoded by a nucleotide sequence 
set forth in SEQ ID NO: 1 or a polynucleotide sequence 90% 
identical to the polynucleotide sequence of SEQ ID NO:l. 
The presence or absence of an association of the proximal 
polymorphic variant with fat deposition or NIDDM then is 
determined using a known association method, such as a 
method described in the Examples hereafter. In an embodi- 
ment, the incident polymorphic variant is at position 11030, 
15847, or 17338 of SEQ ID NO: 1. In another embodiment, 
the proximal polymorphic viiriant identified sometimes is a 
publicly disclosed pol3niiorphic variant, which for example, 
sometimes is published in a publicly available database. In 
other embodiments, the polymorphic variant identified is not 
publicly disclosed and is discovered using a known method, 
including, but not Umited to, sequencing a region surround- 
ing the incident polymorphic variant in a group of nucleic 
samples. Thus, multiple polymorphic variants proximal to 
an incident polymorphic variant are associated with fat 
deposition and NIDDM using this method. 

[0041] The proximal polymorphic variant often is identi- 
fied in a region surrounding the incident polymorphic vari- 
ant. In certain embodiments, this surrounding region is about 
50 kb flanking the first polymorphic variant (e .g. about 50 kb 
5' of the first polymorphic variant and about 50 kb 3' of the 
first polymorphic variant), and the region sometimes is 
composed of shorter flanking sequences, such as flanking 
sequences of about 40 kb, about 30 kb, about 25 kb, about 
20 kb, about 15 kb, about 10 kb, about 7 kb, about 5 kb, or 
about 2 kb 5' and 3' of the incident polymorphic variant. In 
other embodiments, the region is composed of longer flank- 
ing sequences, such as flanking sequences of about 55 kb, 
about 60 kb, about 65 kb, about 70 kb, about 75 kb, about 
80 kb, about 85 kb, about 90 kb, about 95 kb, or about 100 
kb 5' and 3' of the incident polymorphic variant. 

[0042] In certain embodiments, polymorphic variants 
associated with fat deposition or NIDDM are identified 
iteratively. For example, a first proximal polymorphic vari- 
ant is associated with fat deposition using the methods 
described above and then another polymorphic variant 
proximal to the first proximal polymorphic variant is iden- 
tified (e.g., publicly disclosed or discovered) and the pres- 
ence or absence of an association of one or more other 
polymorphic variants proximal to the first proximal poly- 
morphic variant with fat d^osition or NIDDM is deter- 
mined. 

[0043] The methods described herein are useful for iden- 
tifying or discovering additional polymorphic variants that 


may be used to further characterize a gene, region or loci 
associated with a condition, a disease (e.g., fat deposition or 
NIDDM), or a disorder. For example, allelotyping or geno- 
typing data from the additional polymorphic variants may be 
used to identify a functional mutation or a region of linkage 
disequfiibrium. 

[0044] In certain embodiments, polymorphic variants 
identified or discovered within a region comprising the first 
polymorphic variant associated with fat deposition or 
NIDDM are genotjrped using the genetic methods and 
sample selection techniques described herein (e.g.. Example 
6), and it can be determined whether those polymorphic 
variants are in Unkage disequilibrium with the first poly- 
morphic variant. The size of the region in hnkage disequi- 
librium with the first polymorphic variant also can be 
assessed using these genotyping methods, and sizes of the 
regions surrounding an incident SNP described above often 
are probed for this determination. Thus, provided herein are 
methods for determining whether a proximal polymorphic 
variant is in linkage disequilibrium with an incident poly- 
morphic variant associated with fat deposition or NIDDM, 
and such information is useful for the prognosis methods 
described herein. Example 6 and FIG. 6 demonstrate that 
polymorphic variants in a region spanning from about 
position 11000 to about position 25300 in SEQ ID NO:l or 
a substantially identical nucleic acid are in linkage disequi- 
librium. Polymorphic variants occur at positions 11030, 
14744, 14977, 15046, 15059, 15847, 17338, 21538, 21630, 
21708, 22713, and 25264, and other polymorphic variants in 
linkage disequilibrium may occur within the region or 
outside the region. Polymorphic variants occurring outside 
of the region often are within about 7 kb of position 11000 
and position 25300, and sometimes within about 6 kb, about 
5 kb, about 4 kb, about 3 kb, about 2 kb and about 1 kb of 
these positions. 

[0045] Isolated P2X4 Nucleic Acids and Variants Thereof 

[0046] Feartured herein are isolated P2X4 nucleic adds, 
which include the nucleic acid having the nucleotide 
sequence of SEQ ID NO:l, P2X4 nucleic acid variants, and 
substantially identical nucleic acids to the foregoing. Nucle- 
otide sequences of the P2X4 nucleic acids are sometimes 
referred to herein as "P2X4 nucleotide sequences." A" P2X4 
nucleic acid variant" refers to one aUele that may have 
different polymorphic variations as compared to another 
allele in another subject or the same subject. A polymorphic 
variation in the P2X4 nucleic acid variant may be repre- 
sented on one or both strands in a double-stranded nucleic 
acid or on one chromosomal complement (heterozygous) or 
both chromosomal complements (homozygous)). In certain 
embodiments, the nucleic acid comprises a P2X4 nucleotide 
sequence selected from the group consisting of: (a) the 
nucleotide sequence of SEQ ID NO:l; (b) a nucleotide 
sequence which encodes a polypeptide consisting of the 
amino add sequence of SEQ ID NO:2 or SEQ ID N0:3; (c) 
a nucleotide sequence which encodes a polypeptide that is 
90% identical to the amino acid sequence of SEQ ID NO:2 
or SEQ ID NO: 3 or a nucleotide sequence that is 90% 
identical to the nucleotide sequence of SEQ ID NO:l; (d) a 
fragment of a nucleotide sequence of (a), (b), or (c) com- 
prising a polymorphic variation; and (e) a nucleotide 
sequence complementary to the nucleotide sequences of (a), 
(b), (c), or (d); where the P2X4 nucleotide sequence often 
comprises one or more polymorphic variations selected from 
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the group consisting of a thymine at position 14744, an 
adenine at position 14977, a cytosine at position 15046, a 
thymine at position 21538, a guanine at position 21630, and 
a single nucleotide deletion at position 25264 of SEQ ID 
NO:l, and optionally, a GGGCCCC nucleic acid insert 
between positions 2878 and 2891 in SEQ ID NO:l. The 
P2X4 polynucleotide sequence also sometimes comprises 
one or more polymorphic variations selected from the group 
consisting of an adenine at position 1075, an adenine at 
position 2383, a guanine at position 2494, a single nucle- 
otide insert at position 2878, a thymine at position 3191, a 
guanine at position 25346, a cytosine at position 25798, a 
guanine at position 25864, an adenine at position 26334, a 
thymine at position 26343, and a guanine at position 27043 
of SEQ ID NO:l. In specific embodiments, a P2X4 nucleic 
acid comprises one or more of the following polymorphic 
variations: an adenine or thymine at position 11030 of SEQ 
ID NO:l in a strand, or a guanine or adenine in a comple- 
mentary strand; a cytosine or thymine at position 15847 of 
SEQ ID NO:l in a strand, or a guanine or adenine in a 
complementary strand; a cytosine or thymine at position 
17338 of SEQ ID NO:l in a strand, or a guanine or adenine 
in a complementary strand; presence of TTAT, TCAT, ACAT, 
or ACGC at positions 11030, 15847, 21708, and 22713 of 
SEQ ID NO:l, respectively, in a strand, or presence of 
AATA, ACTA, TGTA, or TGCG in a complementary strand; 
and presence or absence of a GGGCCCC insert between 
positions 2878 and 2891 of SEQ ID NO:l in a strand, or 
presence or absence of a GGGGCCC insert in a comple- 
mentary strand. 

[0047] As used herein, the term "nucleic acid" includes 
DNA molecules (e.g., a complementary DNA (cDNA) and 
genomic DNA(gDNA)) and RNA molecules (e.g., mRNA, 
rRNA, siRNAand tRNA) and analogs of DNA or RNA, for 
example, by use of nucleotide analogs. The nucleic acid 
molecule can be single-stranded and it often is double- 
stranded. The term "isolated or purified nucleic acid" refers 
to nucleic acids that are separated from other nucleic acids 
present in the natural source of the nucleic acid. For 
example, with regard to genomic DNA, the term "isolated" 
includes nucleic acids which are separated from the chro- 
mosome with which the genomic DNA is naturally associ- 
ated. An "isolated" nucleic acid often is free of sequences 
which naturally flank the nucleic acid (i.e., sequences 
located at the 5' and/or 3' ends of the nucleic acid) in the 
genomic DNA of the organism fi-om which the nucleic acid 
is derived. For example, in various embodiments, the iso- 
lated nucleic acid molecule can contain less than about 5 kb, 
4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 5' and/or 3' 
nucleotide sequences which flank the nucleic acid molecule 
in genomic DNA of the cell from which the nucleic acid is 
derived. Moreover, an "isolated" nucleic acid molecule, 
such as a cDNA molecule, can be substantially free of other 
cellular material, or culture medium when produced by 
recombinant techniques, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. 
As used herein, the term "P2X4 gene" refers to a nucleotide 
sequence that encodes a P2X4 polypeptide. 
[0048] Also included herein are nucleic acid fragments. 
These fragments are typically a nucleotide sequence iden- 
tical to a nucleotide sequence in SEQ ID NO: 1, a nucleotide 
sequence substantially identical to a nucleotide sequence in 
SEQ ID NO:l, or a nucleotide sequence that is complemen- 
tary to the foregoing. The nucleic acid fragment may be 


identical, substantially identical or homolgous to a nucle- 
otide sequence in an exon or an intron in SEQ ID NO : 1 and 
may encode a domain or part of a domain of a P2X4 
polypeptide. Sometimes, the fragment comprises one or 
more of the polymorphic variations described herein as 
being associated with increased fat deposition or increased 
risk of developing NIDDM. The nucleic acid fragment often 
is 50, 100, or 200 or fewer base pairs in length, and 
sometimes is about 300, 400, 500, 600, 700, 800, 900, 1000, 
1100, 1200, 1300, 1400, 1500, 2000, 3000, 4000, 5000, 
10000, 15000, or 20000 base pairs in length. A nucleic acid 
fragment that is complementary to a nucleotide sequence 
identical or substantially identical to the nucleotide sequence 
of SEQ ID NO:l and hybridizes to such a nucleotide 
sequence under stringent conditions often is referred to as a 
"probe." Nucleic acid fragments often include one or more 
polymorphic sites, or sometimes have an end that is adjacent 
to a polymorphic site as described hereafter. 

[0049] An example of a nucleic acid fragment is an 
ohgonucleotide. As used herein, the term 'oligonucleotide" 
refers to a nucleic acid comprising about 8 to about 50 
covalently linked nucleotides, often comprising from about 
8 to about 35 nucleotides, and more often from about 10 to 
about 25 nucleotides. The backbone and nucleotides within 
an oKgpnucleotide may be the same as those of naturally 
occurring nucleic acids, or analogs or derivatives of natu- 
rally occurring nucleic acids, provided that oligonucleotides 
having such analogs or derivatives retain the ability to 
hybridize specifically to a nucleic acid comprising a targeted 
polymorphism. Oligonucleotides described hereafter are 
used as hybridization probes or as components of diagnostic 
assays, for example, as described herein. 

[0050] Oligonucleotides typically are synthesized using 
standard methods and equipment, such as the ABI™3900 
High Throughput DNA Synthesizer and the EXPEDITE™ 

8909 Nucleic Acid Synthesizer, both of which are available 
from Applied Biosystems (Foster City, Calif.). Analogs and 
derivatives are exemplified in U.S. Pat. Nos. 4,469,863; 
5,536,821; 5,541,306; 5,637,683; 5,637,684; 5,700,922; 
5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 
5,929,226; 5,977,296; 6,140,482; WO 00/56746; WO 
01/14398, and related publications. Methods for synthesiz- 
ing oligonucleotides comprising such analogs or derivatives 
are disclosed, for example, in the patent publications cited 
above and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955, 
599; 5,962,674; 6,117,992; in WO 00/75372; and in related 
publications. 

[0051] Oligonucleotides also may be linked to a second 
moiety. The second moiety may be an additional nucleotide 
sequence such as a tail sequence (e.g., a poly adenosine tail), 
an adaptor sequence (e.g., phage M13 universal tail 
sequence), and others. Alternatively, the second moiety may 
be a non-nucleotide moiety such as a moiety which facili- 
tates linkage to a soUd support or a label to facilitate 
detection of the oligonucleotide. Such labels include, with- 
out limitation, a radioactive label, a fluorescent label, a 
chemiluaunescent label, a paramagnetic label, and the like. 
The second moiety may be attached to any position of the 
oligonucleotide, provided the oligonucleotide can hybridize 
to the nucleic acid comprising the polymorphism. 
[0052] Uses for Nucleic Acids 

[0053] Nucleic acid coding sequences depicted in SEQ ID 
NO:l, or substantially identical sequences thereof, may be 
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used for diagnostic purposes for detection and control of 
polypeptide expression. Also, included herein are oligo- 
nucleotide sequences such as antisense RNA, smaU-inter- 
fering RNA (siRNA) and DNA molecules and ribozymes 
that function to inhibit translation of a polypeptide. Anti- 
sense techniques and RNA interference techniques are 
known in the art and are described herein. 

[0054] Ribozymes are enzymatic RNA molecules capable 
of catalyzing the specific cleavage of RNA. The mechanism 
of ribozyme action involves sequence specific hybridization 
of the ribozyme molecule to complementary target RNA, 
followed by a endonucleolytic cleavage. Ribozymes may be 
engineered hammerhead motif ribozyme molecules that 
specifically and eflEciently catalyze endonucleoljrtic cleav- 
age of RNA sequences corresponding to or complementary 
to the nucleotide sequences set forth in SEQ ID NO: 1. 
Specific ribozyme cleavage sites vsdthin any potential RNA 
target are initially identified by scanning the target molecule 
for ribozyme cleavage sites which include the following 
sequences, GUA, GUU and GUC. Once identified, short 
RNA sequences of between fifteen (15) and twenty (20) 
ribonucleotides corresponding to the region of the target 
gene containing the cleavage site may be evaluated for 
predicted structural features such as secondary structure that 
may render the oligonucleotide sequence unsuitable. The 
suitability of candidate targets may also be evaluated by 
testing their accessibility to hybridization with complemen- 
tary oligonucleotides, using ribonuclease protection assays. 

[0055] Antisense RNA and DNA molecules, siRNA and 
ribozymes are prepared by any method known in the art for 
the synthesis of nucleic acid molecules. These include 

known techniques for chemicaUy synthesizing oligonucle- 
otides such as solid phase phosphoramidite chemical syn- 
thesis. RNA molecules sometimes are generated by in vitro 
and in vivo transcription of DNA sequences encoding the 
antisense RNA molecule. DNA sequences sometimes are 
incorporated into a vector which include a suitable RNA 
polymerase promoter such as the T7 or SP6 polymerase 
promoters. Also, antisense cDNA constructs that synthesize 
antisense RNA constitutively or inducibly, depending on the 
promoter used, sometimes are introduced into cell lines by 
stable transfection. 

[0056] DNA encoding a polypeptide also is utilized in a 
number of methods, which include diagnosis of conditions 
or diseases, such as fat deposition or NIDDM, resulting from 
aberrant expression of the purinergic receptor. For example, 
the nucleic acid sequence may be used in hybridization 
assays of biopsies or autopsies to diagnose abnormalities of 
expression or function (e.g., Southern or Northern blot 
analysis, in situ hybridization assays). 

[0057] In addition, the expression of a polypeptide during 
embryonic development may also be determined using 
nucleic acid encoding the polypeptide. As addressed, infra, 
production of functionally impaired polypeptide is the cause 
of various disease states, including fat deposition or 
NIDDM. In situ hybridizations using polypeptide as a probe 
may be employed to predict problems related to obesity or 
NIDDM. Further, as indicated, infra, administration of 
human active polypeptide, recombinandy produced as 
described herein, may be used to treat disease states related 
to functionally unpaired polypeptide. Alternatively, gene 


therapy approaches may be employed to remedy deficiencies 
of functional polypeptide or to replace or compete with 
dysfunctional polypeptide. 

[0058] Expression \fectors. Host Cells, and Genetically 
Engineered Cells 

[0059] Provided herein are nucleic acid vectors, often 
expression vectors, which contain a P2X4 nucleic acid. As 
used herein, the term "vector" refers to a nucleic acid 
molecule capable of transporting another nucleic acid to 
which it has been linked and can include a plasmid, cosmid, 
or viral vector. The vector can be capable of autonomous 
replication or it can integrate into a host DNA. Viral vectors 
may include replication defective rettoviruseSi, adenoviruses 
and adeno-associated viruses for example. 

[0060] A vector can include a P2X4 nucleic acid in a form 
suitable for expression of the nucleic acid in a host cell. The 
recombinant expression vector typically includes one or 
more regulatory sequences operatively linked to the nucleic 
acid sequence to be expressed. The term "regulatory 
sequence" includes promoters, enhancers and other expres- 
sion control elements (e.g., polyadenylation signals). Regu- 
latory sequences include those that direct constitutive 
expression of a nucleotide sequence, as well as tissue- 
specific regulatory and/or inducible sequences. The design 
of the expression vector can depend on such factors as the 
choice of the host cell to be transformed, the level of 
expression of polypeptide desired, and the Uke. Expression 
vectors can be introduced into host cells to produce P2X4 
polypeptides, including fusion polypeptides, encoded by 
P2X4 nucleic acids. 

[0061] Recombinant expression vectors can be designed 
for expression of P2X4 polypeptides in prokaryotic or 
eukaryotic cells. For example, P2X4 polypeptides can be 
expressed in E. coli, insect cells (e.g., using baculovirus 
expression vectors), yeast cells, or mammalian cells. Suit- 
able host cells are discussed further in Goeddel, Gene 
Expression Technology: Methods in Enzymology 185, Aca- 
demic Press, San Diego, Calif. (1990). Alternatively, the 
recombinant expression vector can be transcribed and trans- 
lated in vitro, for example using T7 promoter regulatory 
sequences and T7 polymerase. 

[0062] Expression of polypeptides in prokaryotes is most 
often carried out in E. coli with vectors containing consti- 
tutive or inducible promoters directing the expression of 
either fusion or non-fusion polypeptides. Fusion vectors add 
a number of amino acids to a polypeptide encoded therein, 
usually to the amino terminus of the recombinant polypep- 
tide. Such fusion vectors typically serve three purposes: 1) 
to increase expression of recombinant polypeptide; 2) to 
increase the solubihty of the recombinant polypeptide; and 
3) to aid in the purification of the recombinant polypeptide 
by acting as a Hgand in affinity purification. Often, a 
proteoljrtic cleavage site is introduced at the junction of the 
fusion moiety and the recombinant polypeptide to enable 
separation of the recombinant polypeptide from the fusion 
moiety subsequent to purification of the fusion polypeptide. 
Such enzymes, and their cognate recognition sequences, 
include Factor Xa, thrombin and enterokinase. Typical 
fusion expression vectors include pGEX (Pharmacia Bio- 
tech Inc; Smith, D. B. and Johnson, K. S., Gene 67: 31-40 
(1988)), pMAL(New England Biolabs, Beverly, Mass.) and 
pRIT5 (Pharmacia, Piscataway, N.J.) which fuse glutathione 
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S-transferase (GST^, maltose E binding polypeptide, or 
polypeptide A, respectively, to the target recombinant 
polypeptide. 

[0063] Purified fusion polypeptides can be used in screen- 
ing assays and to generate antibodies specific for P2X4 
polypeptides. In a therapeutic embodiment, fusioo polypep- 
tide expressed in a retroviral expression vector is used to 
infect bone marrow cells that are subsequently transplanted 
into irradiated recipients. The pathology of the subject 
recipient is then examined after sufficient time has passed 
(e.g., six (6) weeks). 

[0064] Expressing the polypeptide in host bacteria with an 
impaired capacity to proteolytically cleave the recombinant 
polypeptide often is used to maximize recombinant polypep- 
tide expression (Gottesman, S., Gene Eyq?ression Technol- 
ogy: Methods in Enzymology, Academic Press, San Diego, 
Calif. 185: 119-128 (1990)). Another strategy is to alter the 
nucleotide sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each 
amino acid are those preferentially utilized in E. coll (Wada 
et al.. Nucleic Acids Res. 20: 2111-2118 (1992)). Such 
alteration of nucleotide sequences can be carried out by 
standard DNA synthesis techniques. 

[0065] When used in mammaUan cells, the expression 
vector's control functions often are provided by viral regu- 
latory elements. For example, commonly used promoters are 
derived from polyoma. Adenovirus 2, C5rtomegalovirus and 
Simiiui Virus 40. Recombinant mammalian expression vec- 
tors often are capable of directing expression of the nucleic 
acid in a particular cell type (e.g., tissue-specific regulatory 
elements are used to express the nucleic acid). Non-limiting 
examples of suitable tissue-specific promoters include an 
albumin promoter (liver-specific; Pinkert et al., Genes Dev. 
1: 268-277 (1987)), lymphoid-speciflc promoters (Calame 
and Eaton, A(^v. Immunol. 43: 235-275 (1988)), promoters of 
T cell receptors (Winoto and Baltimore, EMBO J. 8: 729- 
733 (1989)) promoters of immunoglobulins (Banerji et al., 
Cell 33: 729-740 (1983); Queen and Baltimore, Cell 33: 
741-748 (1983)), neuron-specific promoters (e.g., the neu- 
rofilament promoter; Byrne and Ruddle, Proc. Natl. Acad. 
Set. USA 86: 5473-5477 (1989)), pancreas-specific promot- 
ers (Edlund et al., Science 230: 912-916 (1985)), and mam- 
mary gland-specific promoters (e.g., milk whey promoter; 
U.S. Pat. No. 4,873,316 and European Application Publica- 
tion No. 264,166). Developmentally-regulated promoters 
are sometimes utilized, for example, the murine hox pro- 
moters (Kessel and Gnxss, Science 249: 374-379 (1990)) and 
the ct-fetopolypeptide promoter (Campes and Tilghman, 
Genes Dev. 3: 537-546 (1989)). 

[0066] A P2X4 nucleic acid may also be cloned into an 

expression vector in an antisense orientation. Regulatory 
sequences (e.g., viral promoters and/or enhancers) opera- 
tively linked to a P2X4 nucleic acid cloned in the antisense 
orientation can be chosen for directum constitutive, tissue 
specific or cell type specific expression of antisense RNAin 
a variety of cell types. Antisense expression vectors can be 
in the form of a recombinant plasmid, phagemid or attenu- 
ated virus. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub, H., et al., 
Antisense RNA as a molecular tool for genetic analysis. 
Reviews — Trends in Genetics, Vol. 1(1) (1986), 
[0067] Also provided herein are host cells that include a 
P2X4 nucleic acid within a recombinant expression vector 


or P2X4 nucleic acid sequence fragments which allow it to 
homologously recombine into a specific site of the host cell 
genome. The terms "host cell" and "recombinant host cell" 
are used interchangeably herein. Such terms refer not only 
to the particular subject cell but rather also to the progeny or 
potential progeny of such a cell. Because certain modifica- 
tions may occur in succeeding generations due to either 
mutation or environmental infiuences, such progeny may 
not, in £act, be identical to the parent cell, but are still 
included within the scope of the term as used herein. A host 
cell can be any prokaryotic or eukaryotic cell. For example, 
a P2X4 polypeptide can be expressed in bacterial cells such 
as E. call, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

[0068] Vectors can be introduced into host cells via con- 
ventional transformation or transfection techniques. As used 

herein, the terms "transformation" and "transfection" are 
intended to refer to a variety of art-recognized techniques for 
introducing foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipi- 
tation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. 

[0069] A host cell provided herein can be used to produce 
(i.e., express) a P2X4 polypeptide. Accordingly, further 
provided are methods for producing a P2X4 polypeptide 
using the host cells of the invention. In one embodiment, the 
method includes culturing host cells into which a recombi- 
nant expression vector encoding a P2X4 polypeptide has 
been introduced in a suitable medium such that a P2X4 
polypeptide is produced. In another embodiment, the 
method further includes isolating a P2X4 polypeptide from 
the medium or the host cell. 

[0070] Also provided are cells or purified preparations of 
cells which include a P2X4 transgene, or which otherwise 
misexpress P2X4 polypeptide. Cell preparations can consist 
of human or non-human cells, e.g., rodent cells, e.g., mouse 
or rat cells, rabbit cells, or pig cells. In preferred embodi- 
ments, the ceU or cells include a P2X4 transgene (e.g., a 
heterologous form of a P2X4 such as a human gene 
expressed in non-human cells). The P2X4 transgene can be 
misexpressed, e.g., overexpressed or underexpressed. In 
other preferred embodiments, the cell or cells include a gene 
which misexpress an endogenous P2X4 polypeptide (e.g., 
expression of a gene is dismpted, also known as a knockout). 
Such cells can serve as a model for studying disorders which 
are related to mutated or mis-expressed P2X4 alleles or for 
use in drug screening. Also provided are himian cells (e.g., 
a hematopoietic stem cells) transformed with a P2X4 nucleic 
acid. 

[0071] Also provided are cells or a piuified preparation 
thereof (e.g., himian cells) in which an endogenous P2X4 
nucleic add is under the control of a regulatory sequence 
that does not normally control the expression of the endog- 
enous P2X4 gene. The expression characteristics of an 
endogenous gene within a cell (e.g., a cell line or microor- 
ganism) can be modified by inserting a heterologous DNA 
regulatory element into the genome of the cell such that the 
inserted regulatory element is operably linked to the endog- 
enous P2X4 gene. For example, an endogenous P2X4 gene 
(e.g., a gene which is "transcriptionally silent," not normally 
expressed, or expressed only at very low levels) may be 
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activated by inserting a regulatory element which is capable 
of promoting the expression of a normally expressed gene 
product in that ceU. Techniques such as targeted homologous 
recombinations, can be used to insert the heterologous DNA 
as described in, e.g., Chappel, U.S. Pat. No. 5,272,071; WO 
91/06667, published on May 16, 1991. 

[0072] Transgenic Animals 

[0073] Non-human transgenic animals that express a het- 
erologous P2X4 polypeptide (e.g., expressed from a P2X4 
nucleic acid isolated from another organism) can be gener- 
ated. Such animals are useful for studying the function 
and/or activity of a P2X4 polypeptide and for identifying 
and/or evaluating modulators of P2X4 nucleic add and 
P2X4 polypeptide activity. As used herein, a "toinsgenic 
animal" is a non-human animal such as a mammal (e.g., a 
non-human primate such as chimpanzee, baboon, or 
macaque; an ungulate such as an equine, bovine, or caprine; 
or a rodent such as a rat, a mouse, or an Israeli sand rat), a 
bird (e.g., a chicken or a turkey), an amphibian (e.g., a frog, 
salamander, or newt), or an insect (e.g., drosophila melano- 
gaster), in which one or more of the cells of the animal 
includes a P2X4 transgene. A transgene is exogenous DNA 
or a rearrangement (e.g., a deletion of endogenous chromo- 
somal DNA) that often is integrated into or occurs in the 
genome of cells in a hransgenic animal. A transgene can 
direct expression of an encoded gene product in one or more 
cell types or tissues of the transgenic animal, and other 
transgenes can reduce expression (e.g., a knockout). Thus, a 
transgenic animal can be one in which an endogenous P2X4 
gene has been altered by homologous recombination 
between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal (e.g., an 
embryonic cell of the animal) prior to development of the 
animaL 

[0074] Intronic sequences and polyadenylation signals can 
also be included in the transgene to increase expression 
efficiency of the transgene. One or more tissue-specific 
regulatory sequences can be operably linked to a P2X4 
transgene to direct expression of a P2X4 polypeptide to 
particular cells. A transgenic founder animal can be identi- 
fied based upon the presence of a P2X4 transgene in its 
genome and/or expression of P2X4 mRNAin tissues or cells 
of the animals. A transgenic foimder animal can then be used 
to breed additional animals carryii^ the transgene. More- 
over, transgenic animals carrying a transgene encoding a 
P2X4 polypeptide can further be bred to other transgenic 
animals carrying other transgenes. 

[0075] P2X4 polypeptides can be expressed in transgenic 
animals or plants by introducing, for example, a nucleic acid 
encoding the polypeptide into the genome of an animal. In 
preferred embodiments the nucleic acid is placed under the 
control of a tissue specific promoter, e.g., a milk or egg 
speci&c promoter, and recovered &om the milk or eggs 
produced by the animal. Also included is a population of 
ceUs from a trani^enic animal. 

[0076] P2X4 Polypeptides 

[0077] Also featured herein are isolated P2X4 polypep- 
tides, which include a polypeptide having the amino add 
sequence of SEQ ID NO: 2 or SEQ ID NO: 3, P2X4 polypep- 
tide variants, and substantially identical polypeptides 
thereof. A P2X4 polypeptide variant is a polypeptide 


encoded by a P2X4 nucleic acid variant. An "isolated" or 
"purified" polypeptide or protein is substantially free of 
cellular material or other contaminating proteins from the 
cell or tissue source from which the protein is derived, or 
substantially free from chemical precursors or other chemi- 
cals when chemically synthesized. In one embodiment, the 
language "substantially free" means preparation of a P2X4 
polypeptide or P2X4 polypeptide variant having less than 
about 30%, 20%, 10% and more preferably 5% (by dry 
weight), of non-P2X4 polypeptide (also referred to herein as 
a "contaminating protein"), or of chemical precursors or 
non-P2X4 chemicals. When the P2X4 polypeptide or a 
biologically active portion thereof is recombinantly pro- 
duced, it is also preferably substantially free of ctilture 
medium, specifically, where culture medium represents less 
than about 20%, sometimes less than about 10%, and often 
less than about 5% of the volume of the polypeptide prepa- 
ration- Isolated or purified P2X4 polypeptide preparations 
are sometimes 0.01 miUigrams or more or 0.1 milligrams or 
more, and often 1.0 milligrams or more and 10 miUigrams 
or more in dry weight. 

[0078] Further included herein are P2X4 polypeptide frag- 
ments. The polypeptide fragment may be a domain or part of 
a domain of a P2X4 polypeptide. P2X4 domains include, but 
are not limited to, an ATP P2X receptor domain at about 
amino acid positions 15 to 361 of SEQ ID NO:2. The 
polypeptide fragment may have increased, decreased or 
unexpected biological activity. The polypeptide fragment 
often is 50 or fewer, 100 or fewer, or 200 or fewer amino 
adds in length, and is sometimes 300 or 388 or fewer amino 
adds in length. 

[0079] Substantially identical polypeptides may depart 

from the amino acid sequence of SEQ ID NO: 2 in different 
manners. For example, conservative amino acid modifica- 
tions may be introduced at one or more positions in the 
amino acid sequence of SEQ ID NO:2 or SEQ ID NO:3. A 
"conservative amino acid substitution" is one in which the 
amino acid is replaced by another amino acid having a 
similar structure and/or chemical function. Families of 
amino add residues having similar structures and functions 
are well known. These families include amino acids with 
basic side chains (e.g., ly,sine, arginine, histidine), acidic 
side chains (e.g., aspartic acid, glutamic acid), uncharged 
polar side chams (e.g., glydne, a^aragine, glutamine, 
serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, phenyla- 
lanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine). Also, 
essential and non-essential amino acids may be replaced. A 
"non-essential" amino acid is one that can be altered without 
abohshing or substantially altering the biological function of 
a P2X4 polypeptide, whereas altering an "essential" amino 
acid aboUshes or substantially alters the biological function 
of a P2X4 polypeptide. Amino acids that are conserved 
among purinergic receptors (e.g., P2X1, P2X2, P2X3, 
P2X4, P2X5, P2X6, and P2X7) are typically essential amino 
adds. 

[0080] Also, P2X4 polypeptides and polypeptide variants 
may exist as chimeric or fusion polypeptides. As used 
herein, a P2X4 "chimeric polypeptide" or "fusion polypep- 
tide" includes a P2X4 polypeptide linked to a non-P2X4 
polypeptide. A"non-P2X4 polypeptide" refers to a polypep- 
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tide having an amino acid sequence corresponding to a 
polypeptide which is not substantially identical to the P2X4 
polypeptide, which includes, for example, a polypeptide that 
is different from the P2X4 polypeptide and derived from the 
same or a different organism. The P2X4 polypeptide in the 
fusion polypeptide can correspond to an entire or nearly 
entire P2X4 polypeptide or a fragment thereof. The non- 
P2X4 polypeptide can be fused to the N-terminus or C-ter- 
minus of the P2X4 polypeptide. 

[0081] Fusion polypeptides can include a moiety having 
high affinity for a ligand. For example, the fusion polypep- 
tide can be a GST-P2X4 fusion polypeptide in which the 
P2X4 sequences are fused to the C-terminus of the GST 
sequences, or a polyhistLdine-P2X4 fusion polypeptide in 
which the P2X4 polypeptide is fused at the N- or C-terminus 
to a string of histidloe residues. Such fusion polypeptides 
can facilitate purification of recombinant P2X4. Expression 
vectors are commercially available that already encode a 
fusion moiety (e.g., a GST polypeptide), and a P2X4 nucleic 
acid can be cloned into an expression vector such that the 
fusion moiety is hnked in-frame to the P2X4 polypeptide. 
Further, the fusion polypeptide can be a P2X4 polypeptide 
containing a heterologous signal sequence at its N-terminus. 
In certain host cells (e.g., mammalian host cells), expres- 
sion, secretion, cellular internalization, and cellular local- 
ization of a P2X4 polypeptide can be increased through use 
of a heterologous signal sequence. Fusion polypeptides can 
also include all or a part of a serum polypeptide (e.g., an IgG 
constant region or human serum albumin). 

[0082] P2X4 polypeptides can be incorporated into phar- 
maceutical compositions and administered to a subject in 
vivo. Administration of these P2X4 polypeptides can be 
used to affect the bioavailability of a P2X4 substrate and 
may effectively increase P2X4 biological activity in a cell. 
P2X4 fusion poljrpeptides may be useful therapeutically for 
the treatment of disorders caused by, for example, (i) aber- 
rant modification or mutation of a gene encoding a P2X4 
polypeptide; (ii) mis-regulation of the P2X4 gene; and (iii) 
aberrant post-translational modification of a P2X4 polypep- 
tide. Also, P2X4 polypeptides can be used as immunogens 
to produce anti-P2X4 antibodies in a subject, to purify P2X4 
ligands or binding partners, and in screening assays to 
identify molecides which inhibit or enhance the interaction 
of P2X4 with a P2X4 substrate. P2X4 polypeptides also 
sometimes are detected in a sample from a subject in a 
method for diagnosing or prognosing fat deposition or 
diabetes in the subject. In such embodiments, an alteration 
is detected in a P2X4 polypeptide that distinguishes it from 
P2X4 polypeptide in a subject not predisposed or diagnosed 
with fat deposition related disorders, such as NIDDM. Such 
alterations are detected in a variety of manners, which 
include, for example, contacting the sample with an anti- 
body that specifically binds to a P2X4 polypeptide or altered 
P2X4 polypeptide, or sequencing regions of the polypeptide. 

[0083] Substantially Identical P2X4 Nucleic Acids and 
Polypeptides 

[0084] P2X4 nucleotide sequences and P2X4 polypeptide 
sequences that are substantially identical to the nucleotide 
sequence of SEQ ID NO:l and the polypeptide sequence of 
SEQ ID NO:2. respectively, are included herein. The term 
"substantially identical" as used herein refers to two or more 
nucleic acids or polypeptides sharing one or more identical 


nucleotide sequences or polypeptide sequences, respec- 
tively. Included are nucleotide sequences or polypeptide 
sequences that are 55%, 60%, 65%, 70%, 75%, 80%, 85%, 
90%, 95% (each often within a 1%, 2%, 3% or 4% vari- 
ability) identical to the P2X4 nucleotide sequence in FIG. 1 
(SEQ ID NO:l) or the P2X4 polypeptide sequence of FIG. 
3A (SEQ ID NO:2). One test for determining whether two 
nucleic acids are substantially identical is to determine the 
percent of identical nucleotide sequences or polypeptide 
sequences shared between the nucleic acids or polypeptides. 

[0085] Calculations of sequence identity often are per- 
formed as follows. Sequences are aligned for optimal com- 
parison purposes (e.g., gaps can be introduced in one or both 
of a first and a second amino acid or nucleic acid sequence 
for optimal alignment and non-homologous sequences can 
be disregarded for comparison purposes). The length of a 
reference sequence aligned for comparison purposes is 
sometimes 30% or more, 40% or more, 50% or more, often 
60% or more, and more often 70%, 80%, 90%, 100% of the 
length of the reference sequence. The nucleotides or amino 
acids at corresponding nucleotide or polypeptide positions, 
respectively, are then compared among the two sequences. 
When a position in the first sequence is occupied by the 
same nucleotide or amino acid as the corresponding position 
in the second sequence, the nucleotides or amino acids are 
deemed to be identical at that position. The percent identity 
between the two sequences is a function of the number of 
identical positions shared by the sequences, taking into 
account the number of gaps, and the length of each gap, 
introduced for optimal alignment of the two sequences. 

[0086] Comparison of sequences and determination of 
percent identity between two sequences can be accom- 
plished using a mathematical algorithm. Percent identity 
between two amino acid or nucleotide sequences can be 
determined using the algorithm of E. Meyers and W. Miller, 
CABIOS 4: 11-17 (1989), which has been incorporated into 
the ALIGN program (version 2.0), using a PAM 120 weight 
residue table, a gap length penalty of 12 and a gap penalty 
of 4. Also, percent identity between two amino acid 
sequences can be determined using the Needleman and 
Wunsch, J. Mol Biol 48: 444-453 (1970) algorithm which 
has been incorporated into the GAP program in the GCG 
software package (available at the http address www.gcg- 
.com), using either a Blossum 62 matrix or a PAM250 
matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a 
length weight of 1, 2, 3, 4, 5, or 6. Percent identity between 
two nucleotide sequences can be determined using the GAP 
program in the GCG software package (available at http 
address www.gcg.com), using a NWSgapdna.CMP matrix 
and a gap weight of 40, 50, 60, 70, or 80 and a length weight 
of 1, 2, 3, 4, 5, or 6. A set of parameters often used is a 
Blossum 62 scoring matrix with a gap open penalty of 12, 
a gap extend penalty of 4, and a frameshift gap penalty of 5. 

[0087] Another manner for determining if two nucleic 
acids are substantially identical is to assess whether a 
polynucleotide homologous to one nucleic acid will hybrid- 
ize to the other nucleic acid under stringent conditions. As 
use herein, the term "stringent conditions" refers to condi- 
tions for hybridization and washing. Stringent conditions are 
known to those skilled in the art and can be found in Current 
Protocols in Molecular Biology, John Wiley & Sons, N.Y., 
6.3.1-6.3.6 (1989). Aqueous and non-aqueous methods are 
described in that reference and either can be used. An 
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example of stringent hybridization conditions is hybridiza- 
tion in 6x sodium chloride/sodium citrate (SSC) at about 45** 
C, followed by one or more washes in 0.2xSSC, 0.1% SDS 
at 50** C. Another example of stringent hybridization con- 
ditions are hybridization in 6x sodium chloride/sodium 
citrate (SSC) at about 45° C, followed by one or more 
washes in 0.2xSSC, 0.1% SDS at 55° C. A further example 
of stringent hybridization conditions is hybridization in 6x 
sodium chloride/sodium citrate (SSC) at about 45"* C, 
followed by one or more washes in 0.2xSSC, 0.1% SDS at 
60° C. Often, stringent hybridization conditions are hybrid- 
ization in 6x sodium chloride/sodium citrate (SSC) at about 
45° C, followed by one or more washes in 0.2xSSC, 0.1% 
SDS at 65° C. More often, stringency conditions are 0.5M 
sodium phosphate, 7% SDS at 65° C, followed by one or 
more washes at 0.2xSSC, 1% SDS at 65° C. 

[0088] An example of a substantially identical nucleotide 
sequence to SEQ ID NO:l is one that has a different 
nucleotide sequence and still encodes the pol3q)eptide 
sequence of SEQ ID NO:2 or SEQ ID NO:3. Another 
example is a nucleotide sequence that encodes a polypeptide 
having a polypeptide sequence that is more than 70% 
identical to, sometimes more than 75%, 80%, or 85% 
identical to, and often more than 90% and 95% identical to 
the polypeptide sequence of SEQ ID NO: 2 or SEQ ID NO:3. 

[0089] P2X4 nucleotide sequences and polypeptide 

sequences can be used as "query sequences" to perform a 
search against pubhc databases to identify other family 
members or related sequences, for example. Such searches 
can be performed using the NBLAST and XBLAST pro- 
grams (version 2.0) of Altschul et al., J. Mol. Biol 215: 
403-10 (1990). BLAST nucleotide searches can be per- 
formed with the NBLAST program, score=100, 
wordlength=12 to obtain nucleotide sequences homologous 
to P2X4 nucleic acid molecules. BLAST polypeptide 
searches can be performed with the XBLAST program, 
score=50, wordlength=3 to obtain amino acid sequences 
homologous to P2X4 polypeptides. To obtain gapped align- 
ments for comparison purposes. Gapped BLAST can be 
utilized as described in Altschul et al.. Nucleic Acids Res. 
25(17): 3389-3402 (1997). When utilizing BLAST and 
Gapped BLAST programs, default parameters of the respec- 
tive programs (e.g., XBLAST and NBLAST) can be used 
(see the ht^ address www.ncbi.nlm.nih.gov). 

[0090] A nucleic add that is substantially identical to the 
nucleotide sequence of SEQ ID NO:l may include poly- 
morphic sites at positions equivalent to those described 
herein (e.g., position 11030 in SEQ ID NO:l) when the 
sequences are aligned. For example, using the alignment 
procedures described herein, SNPs in a sequence substan- 
tially identical to the sequence of SEQ ID NO:l can be 
identified at nucleotide positions that match (i.e., align) with 
nucleotides at SNP positions in SEQ ID NO:l. Also, where 
a polymorphic variation is an insertion or deletion, insertion 
or deletion of a nucleotide sequence from a reference 
sequence (e.g., a GGGCCCC insertion between positions 
2878 to 2891 of SEQ ID NO:l) can change the relative 
positions of other polymorphic sites in the nucleotide 
sequence. For example, a GGGCCCC insertion within posi- 
tions 2878 to 2891 in SEQ ID NO:l will shift a SNP at 
position 11030 in SEQ ID NO;l to position 11037. Also, a 
P2X4 nucleotide sequence sometimes includes an insert 


des:^nated in FtG. 5, which can alter the nucleotide num- 
bering designations after position 19137. 

[0091] Substantially identical P2X4 nucleotide and 
polypeptide sequences include those that are naturally 
occurring, such as allelic variants (same locus), splice vari- 
ants, homologs (difEerent locus), and orthologs (difiBerent 
organism) or can be non-naturally occurring. Non-naturaUy 
occurring variants can be generated by mutagenesis tech- 
niques, including those applied to poljmucleotides, cells, or 
organisms. The variants can contain nucleotide substitu- 
tions, deletions, inversions and insertions. Variation can 
occur in either or both the coding and non-coding regions. 
The variations can produce both conservative and non- 
conservative amino acid substitutions (as compared in the 
encoded product). Orthologs, homologs, allelic variants, and 
splice variants can be identified using methods known in the 
art. These variants normally comprise a nucleotide sequence 
encoding a polypeptide that is 50%, about 55% or more, 
often about 70-75% or more, more often about 80-85% or 
more, and typically about 90-95% or more identical to the 
amino acid sequence shown in SEQ ID NO:2 or a fragment 
thereof. Such nucleic acid molecules can readily be identi- 
fied as being able to hybridize under stringent conditions to 
the nucleotide sequence shown in SEQ ID NO:l or a 
fragment of this sequence. Nucleic acid molecules corre- 
sponding to orthologs, homologs, and allehc variants of the 
P2X4 nucleotide sequence can further be identified by 
mapping the sequence to the same chromosome or locus as 
the P2X4 nucleotide sequence or variant. 

[0092] Also, substantially identical P2X4 nucleotide 
sequences may include codons that are altered with respect 
to the naturally occurring sequence for enhancing expression 
of a P2X4 polypeptide or polypeptide variant in a particular 
expression system. For example, the nucleic acid can be one 
in which one or more codons are altered, and often 10% or 
more or 20% or more of the codons are altered for optimized 
expression in bacteria (e.g., E. coli.y, yeast (e.g., S. cerve- 
siae), human (e.g., 293 cells), insect, or rodent (e.g., ham- 
ster) celk. 

[0093] Fat Deposition Disorder Prognostic and Diagnostic 
Methods 

[0094] Methods for prognosing and diagnosing fat depo- 
sition, its related disorders (e.g., obesity and NIDDM) and 
leanness in subjects are provided herein. These methods 

include detecting the presence or absence of one or more 
polymorphic variations in a P2X4 nucleotide sequence or 
substantially identical sequence thereof in a sample from a 
subject, where the presence of a polymorphic variant 
described herein is indicative of a predisposition to leanness 
or fat deposition or one or more fat deposition related 
disorders (e.g., obesity or NIDDM). Determining a predis- 
position to fat deposition refers to determining whether an 
individual is at an increased or intermediate risk of fat 
deposition and determining a predisposition to leanness 
refers to a decreased risk of fat deposition. Determining a 
predisposition to NIDDM refers to determining whether an 
individual is at risk of NIDDM. 

[0095] Thus, featured herein is a method for detecting a 
predisposition to fat deposition and a fat deposition disorder, 

such as obesity and NIDDM, in a subject, which comprises 
detecting the presence or absence of a polymorphic variation 
associated with fat deposition at a polymorphic site in a 
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P2X4 nucleotide sequence in a nucleic acid sample from a 
subject, wherein the P2X4 nucleotide sequence comprises a 
polynucleotide sequence selected from the group consisting 
of: (a) the nucleotide sequence of SEQ ID NO:l; (b) a 
nucleotide sequence which encodes a polypeptide consisting 
of the amino acid sequence of SEQ ID NO:2 or SEQ ID 
NO:3; (c) a nucleotide sequence which encodes a polypep- 
tide that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3 or a nucleotide sequence about 
90% or more identical to the nucleotide sequence of SEQ ID 
NO:l; and (d) a fragment of a nucleotide sequence of (a), 
(b), or (c) comprising the polymorphic site; whereby the 
presence of the polymorphic variation is indicative of a 
predisposition to fax deposition in the subject. In certain 
embodiments, polymorphic variants at positions 11030, 
15847 and 17338 are detected for determining a predispo- 
sition to fat deposition, a polymorphic variant at position 
15847 is detected for determining a predisposition to 
NIDDM and polymorphic variants at positions in linkage 
disequilibrium with these positions are detected for deter- 
mining a predisposition to fat deposition and NIDDM. 
Polymorphic variants within a region spanning from about 
position 11000 to about position 25300 in SEQ ID NO: 1 or 
a substantially identical nucleic acid sometimes are identi- 
fied to assess whether a subject is predisposed to fat depo- 
sition and fat deposition disorders (e.g., obesity and 
NIDDM) as many polymorphic variants in this region are in 
linkage disequilibrium. 

[0096] Results from prognostic tests may be combined 
with other test results to diagnose fat deposition related 

disorders, including NIDDM. For example, prognostic 
results may be gathered, a patient sample may be ordered 
based on a determined predisposition to fat deposition or 
NIDDM, the patient sample is analyzed, and the results of 
the analysis may be utilized to diagnose the fat deposition 
related condition (e.g., NIDDM). Also fat deposition diag- 
nostic methods can be developed from studies used to 
generate prognostic methods in which populations are strati- 
fied into subpopulations having different progressions of a 
fat deposition related disorder or condition. 

[0097] Predisposition to fat deposition, fat deposition 
related disorders such as NIDDM and obesity, and leanness 
sometimes is expressed as a probabihty, such as an odds 
ratio, percenta^, or risk factor. The predisposition is based 
upon the presence or absence of one or more polymorphic 
variants described herein, and also may be based in part 
upon phenotypic traits of the individual being tested. Meth- 
ods for calculating predispositions based upon patient data 
are well known (see, e.g., Agresti, Categorical Data Analy- 
sis^, 2nd Ed. 2002. WUey). Allelo typing and geno typing 
analyses may be carried out in populations other than those 
exemplified herein to enhance the predictive power of the 
prognostic method. These further analyses are executed in 
view of the exemplified procedures described herein, and 
may be based upon the same polymorphic variations or 
additional polymorphic variations. 

[0098] The nucleic acid sample typically is isolated from 
a biological sample obtained from a subject. For example, 
nucleic acid can be isolated from blood, saliva, sputum, 
urine, cell scrapings, and biopsy tissue. The nucleic acid 

sample can be isolated from a biological sample using 
standard techniques, such as the technique described in 
Example 2. As used herein, the term "subject" refers pri- 


marily to humans but also refers to other mammals such as 
dogs, cats, and ungulates (e.g., cattle, sheep, and swine). 
Subjects also include avians (e.g., chickens and turkeys), 
reptiles, and fish (e.g., salmon), as embodiments described 
herein can be adapted to nucleic acid samples isolated from 
any of these organisms. The nucleic acid sample may be 
isolated from the subject and then directly utihzed in a 
method for determining the presence of a polymorphic 
variant, or alternatively, the sample may be isolated and then 
stored (e.g., frozen) for a period of time before being 
subjected to analysis. 

[0099] The presence or absence of a pol5m3orphic variant 

is determined using one or both chromosomal complements 
represented in the nucleic acid sample. Determining the 
presence or absence of a polymorphic variant in both 
chromosomal complements represented in a nucleic acid 
sample from a subject having a copy of each chromosome is 
useful for determining the zygosity of an individual for the 
polymorphic variant (i.e., whether the individual is homozy- 
gous or heterozygous for the polymorphic variant). Any 
oligonucleotide -based diagnostic may be utilized to deter- 
mine whether a sample includes the presence or absence of 
a polymorphic variant in a sample. For example, primer 
extension methods, l^ase sequence determination methods 
(e.g., U.S. Pat. Nos. 5,679,524 and 5,952,174, and WO 
01/27326), mismatch sequence determination methods (e.g., 
U.S. Pat. Nos. 5,851,770; 5,958,692; 6,110,684; and 6,183, 
958), micro array sequence determination methods, restric- 
tion fragment length polymorphism (RFLP), sii^e strand 
conformation polymorphism detection (SSCP) (e.g., U.S. 
Pat. Nos. 5,891,625 and 6,013,499), PCR -based assays (e.g., 
TAQMAN® PCR System (Applied Biosystems)), and 
nucleotide sequencing methods may be used. 

[0100] Oligonucleotide extension methods typically 
involve providing a pair of oligonucleotide primers in a 
polymerase chain reaction (PCR) or in other nucleic acid 
amplification methods for the purpose of amplifying a 
region from the nucleic acid sample that comprises the 
polymorphic variation. One oligonucleotide primer is 
complementary to a region 3' of the polymorphism and the 
other is complementary to a region 5' of the polymorphism. 
A PCR primer pair may be used in methods disclosed in U.S. 
Pat. Nos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 
5,998,143; 6,140,054; WO 01/27327; and WO 01/27329 for 
example. PCR primer pairs may also be used in any com- 
mercially available machines that perform PCR, such as any 
of the GENEAMPCg) Systems available from AppHed Bio- 
systems. Also, those of ordinary skill in the art will be able 
to design oligonucleotide primers based upon the nucleotide 
sequence of SEQ ID NO:l without undue experimentation 
using knowledge readily available in the art. 

[0101] Also provided is an extension oligonucleotide that 
hybridizes to the amplified fragment adjacent to the poly- 
morphic variation. As used herein, the term "adjacent" refers 
to the 3' end of the extension oligonucleotide being often 1 
nucleotide from the 5' end of the polymorphic site, and 
sometimes 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides from the 
5' end of the polymorphic site, in the nucleic acid when the 
extension oligonucleotide is hybridized to the nucleic acid. 
The extension oligonucleotide then is extended by one or 
more nucleotides, and the number and/or type of nucleotides 
that are added to the extension oligonucleotide determine 
whether the polymorphic variant is present. Oligonucleotide 
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extension methods are disclosed, for example, in U.S. Pat. 
Nos. 4,656,127; 4,851,331; 5,679,524; 5,834,189; 5,876, 
934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004, 
744; 6,013,431; 6,017,702; 6,046,005; 6,087,095; 6,210, 
891; and WO 01/20039. Oligonucleotide extension methods 
using mass spectrometry are described, for example, in U.S. 
Pat. Nos. 5,547,835; 5,605,798; 5,691,141; 5,849,542; 
5,869,242, 5,928,906; 6,043,031, and 6,194,144, and a 
method often utilized is described herein in Example 2. 

[0102] A micro array can be utilized for determining 
whether a polymorphic variant is present or absent in a 
nucleic acid sample. A microarray may Include any oligo- 
nucleotides described herein, and methods for making and 
using oligonucleotide micro arrays suitable for diagnostic 
use are disclosed in U.S. Pat. Nos. 5,492,806; 5,525,464; 
5,589,330; 5,695,940; 5,849,483; 6,018,041; 6,045,996; 
6,136,541; 6,142,681; 6,156,501; 6,197,506; 6,223,127; 
6,225,625; 6,229,911; 6,239,273; WO 00/52625; WO 
01/25485; and WO 01/29259. The microarray typicaUy 
comprises a solid support and the oligonucleotides may be 
linked to this solid support by covalent bonds or by non- 
covalent interactions. The oligonucleotides may also be 
linked to the solid support directly or by a spacer molecule, 
A microarray may comprise one or more oligonucleotides 
complementary to a polymorphic site of SEQ ID NO:l (e.g., 
positions 11030, 15847 and 17338). 

[0103] A kit also may be utilized for determining whether 

a polymorphic variant is present or absent in a nucleic acid 
sample. A kit often comprises one or more pairs of ohgo- 
nucleotide primers useful for amplifying a fragment of SEQ 
ID NO:l or a substantially identical sequence thereof, where 
the fragment includes a polymorphic site. The kit sometimes 
comprises a polymerizing agent, for example, a thermo- 
stable nucleic acid polymerase such as one disclosed in U.S. 
Pat. Nos. 4,889,818 or 6,077,664. Also, the kit often com- 
prises an elongation oligonucleotide that hybridizes to a 
P2X4 nucleic acid in a nucleic acid sample adjacent to the 
polymorphic site. Where the kit includes an elongation 
oligonucleotide, it also often comprises chain elongating 
nucleotides, such as dATP, dTTP, dGTP, dCTP, and dITP, 
including analogs of dATP, dTTP, dGTP, dCTP and dITP, 
provided that such analog;s are substrates for a thermostable 
nucleic acid polymerase and can be incorporated into a 
nucleic acid chain elongated from the extension oligonucle- 
otide. Along with chain elongating nucleotides would be one 
or more chain terminating nucleotides such as ddATP, 
ddTTP, ddOTP, ddCTP, and the Hke. In an embodiment, the 
kit comprises one or more oligonucleotide primer pairs, a 
polymerizing agent, chain elongating nucleotides, at least 
one elongation oHgonucleotide, and one or more chain 
terminating nucleotides. Kits optionally include buffers, 
vials, microliter plates, and instructions for use. 

[0104] Determining the presence of a polymorphic vari- 
ant, or a combination of two or more polymorphic variants, 
in a P2X4 nucleic acid of the sample often is indicative of 
a predisposition to fat deposition or leanness. For example, 
presence of an adenine at position 11030 of SEQ ID NO:l 
in the sense strand of a P2X4 nucleotide sequence is 
associated with an increased risk of fat deposition and 
presence of a thymine at position 11030 of SEQ ID NO:l in 
the sense strand of a P2X4 nucleotide sequence is associated 
with leanness or a decreased risk of fat deposition. Specifi- 
cally, a subject homozygous for an adenine at position 11030 


of SEQ ID NO:l in the sense strands of the P2X4 nucleotide 
sequence is at a higher risk of fat deposition, a subject 
heterozygous for an adenine and thymine at position 11030 
in the sense strands of the P2X4 nucleotide sequence is at an 
intermediate risk of increased fat deposition, and a subject 
homozygous for a thymine at position 11030 in the sense 
strands of the P2X4 nucleotide sequence is at a lower risk of 
fat deposition. Similarly, a subject homozygous for a gua- 
nine at position 11030 in the strands complementary to the 
sense strands of the P2X4 nucleotide sequence is at a higher 
risk of increased fat deposition, a subject heterozygous for 
a guanine and adenine at position 11030 in the strands 
complementary to the sense strands of the P2X4 nucleotide 
sequence is at an intermediate risk of increased fat deposi- 
tion, and a subject homozygous for an adenine at position 
11030 in the strands complementary to the sense strands of 
the P2X4 nucleotide sequence is at a decreased risk of fat 
deposition. 

[0105] Also, the presence of a thymine at position 15847 
of SEQ ID NO:l in the sense strand of a P2X4 nucleotide 

sequence is associated with an increased risk of fat deposi- 
tion and NIDDM and the presence of a cytosine at position 
15847 in the sense strand of a P2X4 nucleotide sequence is 
associated with leanness or a decreased risk of fat deposition 
and NIDDM. Specifically, a subject homozygous for a 
thymine at position 15847 of SEQ ID NO:l in the sense 
strands of the P2X4 nucleotide sequence is at a higher risk 
of increased fat deposition and NIDDM, a subject heterozy- 
gous for a thymine and cytosine at position 15847 in the 
sense strands of the P2X4 nucleotide sequence is at an 
intermediate risk of increased fat deposition and NIDDM, 
and a subject homozygous for a cytosine at position 15847 
in the sense strands of the P2X4 nucleotide sequence is at a 
decreased risk of fat deposition and NIDDM. Similarly, a 
subject homozygous for an adenine at position 15847 in the 
strands complementary to the sense strands of the P2X4 
nucleotide sequence is at a higher risk of increased fat 
deposition and NIDDM, a subject heterozygous for a gua- 
nine and adenine at position 15847 in the strands comple- 
mentary to the sense strands of the P2X4 nucleotide 
sequence is at an intermediate risk of increased fat deposi- 
tion and NIDDM, and a subject homozygous for an guanine 
at position 15847 in the strands complementary to the sense 
strands of the P2X4 nucleotide sequence is at a lower risk of 
fat deposition and NIDDM. 

[0106] Also, the presence of a cytosine at position 17338 
of SEQ ID NO:l in the sense strand of a P2X4 nucleotide 
sequence is associated with an increased risk of fat deposi- 
tion and the presence of a thymine at position 17338 in the 
sense strand of a P2X4 nucleotide sequence is associated 
with leanness or a decreased risk of fat deposition. Specifi- 
cally, a subject homozygous for a cytosine at position 17338 
of SEQ ID NO:l in the sense strands of the P2X4 nucleotide 
sequence is at a higher risk of increased fat deposition, a 
subject heterozygous for a cytosine and thymine at position 
17338 in the sense strands of the P2X4 nucleotide sequence 
is at an intermediate risk of increased fat deposition, and a 
subject homozygous for a thymine at position 17338 in the 
sense strands of the P2X4 nucleotide sequence is at a 
decreased risk of fat deposition. Similarly, a subject 
homozygous for a guanine at position 17338 in the strands 
complementary to the sense strands of the P2X4 nucleotide 
sequence is at a higher risk of increased fat deposition, a 
subject heterozygous for a guanine and adenine at position 
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17338 in the strands complementary to the sense strands of 
the P2X4 nucleotide sequence is at an intermediate risk of 
increased fat deposition, and a subject homozygous for an 
adenine at position 17338 in the strands complementary to 
the sense strands of the P2X4 nucleotide sequence is at a 
lower risk of fat deposition. 

[0107] Also, the presence of a haplotypes of TTAT and 
TCAT at positions 11030, 15847, 21708, and 22713, respec- 
tively, in the sense strand of a P2X4 nucleotide sequence 
(SEQ ID NO: 1) are associated with leanness or a decreased 
risk of fat deposition. Similarly, the presence of a haplotype 
of AATAand AGTA at positions 11030, 15847, 21708, and 
22713, respectively, in the strand complementary to the 
sense strand of a P2X4 nucleotide sequence are associated a 
decreased risk of fat deposition. Haplotypes of ACAT and 
ACGC at positions 11030, 15847, 21708, and 22713, respec- 
tively, in the sense strand of a P2X4 nucleotide sequence are 
associated with an increased risk of fat deposition, as well as 
the presence of the haplotypes TGTAand TGCG at positions 
11030, 15847, 21708, and 22713, respectively, in the strand 
complementary to the sense strand of a P2X4 nucleotide 
sequence. 

[0108] Applications of Prognostic and Diagnostic Results 
to Fharmacogenomic Methods 

[0109] Pharmacogenomics is a discipline that involves 
tailoring a treatment for a subject according to the subject's 
genotype as a particular treatment regimen may exert a 
differential effect depending upon the subject's genotype. 
Based upon the outcome of a prognostic test described 
herein, a clinician or physician may target pertinent infor- 
mation and preventative or therapeutic treatments to a 
subject who would be benefited by the information or 
treatment and avoid directing such information and treat- 
ments to a subject who would not be benefited (e.g., the 
treatment has no therapeutic effect and/or the subject expe- 
riences adverse side effects). 

[0110] For example, where a candidate therapeutic exhib- 
its a significant interaction with a major allele and a com- 
paratively weak interaction with a minor allele (e.g. an order 
of magnitude or greater difference ia the interaction), such a 
therapeutic typically would not be administered to a subject 
genotyped as being homozygous for the minor allele, and 
sometimes not administered to a subject genotyped as being 
heterozygous for the minor allele. In another example, 
where a candidate therapeutic is not significantly toxic when 
administered to subjects who are homozygous for a major 
allele but is comparatively toxic when administered to 
subjects heterozygous or homozygous for a minor allele, the 
candidate therapeutic is not typically administered to sub- 
jects who are genotyped as being heterozygous or homozy- 
gous with respect to the minor allele. 

[0111] The prognostic methods described herein are appli- 
cable to pharmacogenomic methods for preventing, allevi- 
ating or treating fat deposition conditions such as obesity 
and NIDDM. For example, a nucleic acid sample from an 
individual may be subjected to a prognostic test described 
herein. Where one or more polymorphic variations associ- 
ated wifli increased risk of obesity or NIDDM are identified 
in a subject, information for preventing or treating obesity or 
NIDDM and/or one or more obesity or NIDDM treatment 
regimens then may be prescribed to that subject. For 
example, a patient having a thymine at position 15847 in 


SEQ ID NO:l often is prescribed a preventative regimen 
designed to minimize the occurrence of obesity or NIDDM. 
[0112] In certain embodiments, a treatment regimen is 
specifically prescribed and/or administered to individuals 
who will most benefit from it based upon their risk of 
developing obesity or NIDDM assessed by the prognostic 
methods described herein. Thus, provided are methods for 
identifying a subject predisposed to obesity or NIDDM and 
then prescribing a therapeutic or preventative regimen to 
individuals identified as having a predisposition. Thus, cer- 
tain embodiments are directed to a method for reducing fat 
deposition, obesity or NIDDM in a subject, which com- 
prises: detecting the presence or absence of a polymorphic 
variant associated with fat deposition, obesity or NIDDM in 
a P2X4 nucleotide sequence in a nucleic acid sample from 
a subject, where the P2X4 nucleotide sequence comprises a 
polynucleotide sequence selected from the group consisting 
of; (a) the polynucleotide sequence of SEQ ID NO:l; (b) a 
polynucleotide sequence which encodes a polypeptide con- 
sisting of the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO:3; (c) a polynucleotide sequence which encodes a 
polypeptide that is 90% identical to the amino acid sequence 
of SEQ ID NO:2 or SEQ ID NO:3; and (d) a fragment of a 
polynucleotide sequence of (i), (ii), or (iii); and prescribing 
or administering a treatment regimen to a subject from 
whom the sample originated where the presence of a poly- 
morphic variation associated with fat reduction is detected in 
the P2X4 nucleotide sequence. In these methods, predispo- 
sition results may be utihzed in combination with other test 
results to diagnose fat deposition associated conditions, such 
as obesity, metabolic conditions (e.g., NIDDM) and cardio- 
vascular conditions (e.g., myocardial infarction). 
[0113] The treatment sometimes is preventative (e.g., is 
prescribed or administered to reduce the probability that a 
fat deposition associated condition arises or progresses), 
sometimes is therapeutic, and sometimes delays, alleviates 
or halts the progression of a fat deposition associated 
condition. Any known preventative or therapeutic treatment 
for alleviating or preventing the occurrence of a fat depo- 
sition associated disorder is prescribed and/or administered. 
For example, the treatment sometimes is or includes a drug 
that reduces fat deposition, including, for example, an appe- 
tite suppressant (e.g., Phentermine, Adipex, Bontril, Didrex, 
lonamin, Meridia, Phendimetrazine, Tenuate, Sibutramine), 
a Upase inhibitor (e.g., Olistat), a phosphoHpase inhibitor, a 
P2X4 nucleic acid, a P2X4 polypeptide, and/or a molecule 
that interacts with a P2X4 nucleic acid or P2X4 polypeptide 
described hereafter. In another example, the treatment is or 
includes a physical exercise regimen, dietary counseling 
and/or a dietary regimen (e.g., a low fat diet and/or a diet 
where the subject eats during pre -scheduled intervals) 
optionally coupled with dietary counseling, psychological 
counsehng and/or psychotherapy, and sometimes optionally 
coupled with prescription of a psychotherapeutic or psy- 
choprophylactic (e.g., an antidepressant or anti-anxiety 
therapeutic). In other embodiments directed to diabetes 
management, a subject sometimes is prescribed a regimen 
for regularly monitoring blood glucose levels, dietary coun- 
seling, a dietary regimen for managing blood glucose levels, 
and/or a blood glucose altering drug regimen. Examples of 
blood glucose altering drug regimens are regular adminis- 
tration of insulin (e.g., injection, pump, inhaler spray, nasal 
spray, insuhn patch, and insulin tablet), and administration 
of hypoglycemics (e.g., glyburide or repaglinide), starch 
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blockers (e.g., acarbose), liver glucose regulating agents 
(e.g., metformin), and/or insulin sensitizers (e.g., rosiglitza- 
one or pioglitazone). Prescription and/or administration of 
each treatment or combinations of treatments often is depen- 
dent upon the age of the subject as well as the subject's 
physiological, medical, and/or psychological condition. 

[0114] In an embodiment, the pharmacogenomic methods 
described herein are applicable to subjects who are women 
about forty or more years of age and have not yet entered 
menopause, undei^oing menopause, or post-menopausal. 
Those subjects identified as having an increased risk for fat 
deposition sometimes are prescribed a hormone replacement 
treatment (HRT) regimen. There are many HRT regimens 
known in the art, which include regular administration of 
estrogen (e.g., Pmmarin®), progesterone (e.g., Provera®), 
androgen (e.g., testosterone), a combination of estro^n and 
progesterone, a combination of estrogen and androgen (e.g., 
Estratest®), growth hormone, dehydroepiandrosterone 
(DHEA), a sulfate ester of DHEA, or a combination of 
DHEA and a DHEA sulfate ester. Also, selective estrogen 
receptor modulators (SERMs) such as raloxifene and tamox- 
ifen, for example, can be prescribed. Those women diag- 
nosed as having an increased risk of fat deposition some- 
times are prescribed an estrogen replacement therapy (ERT) 
regimen or SERMs regimen as an altemative to a combi- 
nation of estrogen and progesterone, due to an association 
between ERT and lower fat deposition and an association 
between increased fat deposition and progesterone replace- 
ment therapy. 

[0115] In another embodiment, pharmacogenomic meth- 
ods are applicable to subjects who are women using a 
contraceptive or are contemplating use of a contraceptive, 
where the contraceptive has been shown to increase fat 
deposition in subjects. This embodiment often applies to 
women who are pre-pubescent, who are in puberty, or who 
are post-pubescent and pre-menopausal. Many oral contra- 
ceptives, especially those that include higher contents of 
estrogen compared to other oral contraceptives, have been 
shown to increase fat deposition in subjects. Those subjects 
identified as having an increased risk for fat deposition by 
the methods described herein often are advised not to begin 
an oral contraceptive regimen or to discontinue an oral 
contraceptive regimen. Alternatively, subjects identified as 
having an increased risk for fat deposition sometimes are 
advised to begin an oral contraceptive regimen using a 
contraceptive having lower estrogen content as compared to 
other available oral contraceptives (e.g., Allesse®, LevKte®, 
Loestrin-Fe®, and Mircette© are examples of contracep- 
tives having lower estrogen content). 

[0116] As therapeutic approaches for obesity or NIDDM 
continue to evolve and improve, the goal of treatments for 
fat deposition related disorders is to intervene even before 

clinical signs (e.g., impaired glucose tolerance) first mani- 
fest. Thus, genetic markers associated with susceptibility to 
obesity or NIDDM prove useful for early diagnosis, pre- 
vention and treatment of obesity or NIDDM. 

[0117] As obesity or NIDDM preventative and treatment 
information can be specifically targeted to subjects ia need 
thereof (e.g. those at risk of developing obesity or NIDDM 
or those that have early stages of obesity or NIDDM), 
provided herein is a method for preventing or reducing the 
risk of developing obesity or NIDDM in a subject, which 


comprises: (a) detecting the presence or absence of a poly- 
morphic variation associated with obesity or NIDDM at a 
polymorphic site in a nucleotide sequence in a nucleic acid 
sample from a subject; (b) identifying a subject with a 
predisposition to obesity or NIDDM, whereby the presence 
of the polymorphic variation is indicative of a predisposition 
to obesity or NIDDM in the subject; and (c) if such a 
predisposition is identified, providing the subject with infor- 
mation about methods or products to prevent or reduce 
obesity or NIDDM or to delay the onset of obesity or 
NIDDM. Also provided is a method of targeting information 
or advertising to a subpopulation of a human population 
based on the subpopulation being genetically predisposed to 
a disease or condition, which comprises: (a) detecting the 
presence or absence of a polymorphic variation associated 
with obesity or NIDDM at a polymorphic site in a nucleotide 
sequence in a nucleic acid sample from a subject; (b) 
identifying the subpopulation of subjects in which the poly- 
morphic variation is associated with obesity or NIDDM; and 
(c) providing information only to the subpopulation of 
subjects about a particular product which may be obtained 
and consumed or applied by the subject to help prevent or 
delay onset of the disease or condition. 

[0118] Pharmacogenomics methods also may be used to 
analyze and predict a response to an obesity or NIDDM 
treatment or a drug. For example, if pharmacogenomics 
analysis indicates a likelihood that an individual will 
respond positively to a obesity or NIDDM treatment with a 
particular drug, ttie drug may be administered to the indi- 
vidual. Conversely, if the analysis indicates that an indi- 
vidual is likely to respond negatively to treatment with a 
particular drug, an alternative course of treatment may be 
prescribed. A negative response may be defined as either the 
absence of an efiScacious response or the presence of toxic 
side effects. The response to a therapeutic treatment can be 
predicted in a background study in which subjects in any of 
the following populations are ^notyped: a population that 
responds favorably to a treatment regimen, a population that 
does not respond significantly to a treatment regimen, and a 
population that responds adversely to a treatment regiment 
(e.g. exhibits one or more side effects). These populations 
are provided as examples and other populations and sub- 
populations may be analyzed. Based upon the results of 
these analyses, a subject is genotyped to predict whether he 
or she will respond favorably to a treatment regimen, not 
respond significantly to a treatment regimen, or respond 
adversely to a treatment regimen. 

[0119] The prognostic tests described herein also are 
applicable to clinical drug trials. One or more polymorphic 
variants indicative of response to an agent for treating 
obesity or NIDDM or to side effects to an agent for treating 
obesity or NIDDM may be identified using the methods 
described herein. Thereafter, potential participants in clini- 
cal trials of such an agent may be screened to identify those 
individuals most likely to respond favorably to the dmg and 
exclude those likely to experience side effects. In that way, 
the effectiveness of drug treatment may be measured in 
individuals who respond positively to the drug, without 
lowering the measurement as a result of the inclusion of 
individuals who are unlikely to respond positively in the 
study and without risking undesirable safety problems. 

[0120] Thus, another embodiment is a method of selecting 
an individual for inclusion in a clinical trial of a treatment or 
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drug comprising the steps of: a) obtaining a nucleic acid 
sample from an individual; b) determining the identity of a 

polymorphic variation which Ls associated with a positive 
response to the treatment or the drug, or at least one 
polymorphic variation which is associated with a negative 
response to the treatment or the drug in the nucleic acid 
sample, and c) including the individual in the clinical trial if 
the nucleic acid sample contains said polymorphic variation 
associated with a positive response to the treatment or the 
drug or if the nucleic acid sample lacks said polymorphic 
variation associated with a negative response to the treat- 
ment or the drug. In addition, the methods of the present 
invention for selecting an individual for inclusion in a 
clinical trial of a treatment or drug encompass methods with 
any further limitation described in this disclosure, or those 
following, specified alone or in any combination. The poly- 
morphic variation may be in a sequence selected individu- 
ally or in any combination from the group consisting of (i) 
a polynucleotide sequence set forth in SEQ ID NO: 1; (ii) a 
polynucleotide sequence that is 90% identical to a nucle- 
otide sequence set forth in SEQ ID NO: 1; (iii) a polynucle- 
otide sequence that encodes a polypeptide having an amino 
acid sequence identical to or 90% identical to an amino acid 
sequence encoded by a nucleotide sequence set forth in SEQ 
ID NO: 1; and (iv) a fragment of a polynucleotide sequence 
of (i), (ii), or (iii) comprising the polymorphic site. The 
including step (c) optionally comprises administering the 
drug or the treatment to the individual if the nucleic acid 
sample contains the polymorphic variation associated with a 
positive response to the treatment or the drug and the nucleic 
acid sample lacks said biallelic marker associated with a 
negative response to the treatment or the drug. 

[0121] Also provided herein is a method of partnering 
between a diagnostic/prognostic testing provider and a pro- 
vider of a consumable product, which comprises: (a) the 
diagnostic/prognostic testing provider detects the presence 
or absence of a polymorphic variation associated with 
obesity or NIDDM at a polymorphic site in a nucleotide 
sequence in a nucleic acid sample from a subject; (b) the 
diagnostic/prognostic testing provider identifies the sub- 
population of subjects in which the polymorphic variation is 
associated with obesity or NIDDM; (c) the diagnostic/ 
prognostic testing provider forwards information to the 
subpopulation of subjects about a particular product which 
may be obtained and consumed or applied by the subject to 
help prevent or delay onset of the disease or condition; and 
(d) the provider of a consumable product forwards to the 
diagnostic test provider a fee every time the diagnostic/ 
prognostic test provider forwards information to the subject 
as set forth in step (c) above. Methods for Identifying 
Candidate Therapeutics for Reducing Fat Deposition 

[0122] Current therapies for the treatment of NIDDM have 
limited efficacy, limited tolerabiUty and significant mecha- 
nisms-based side effects, including weight gain and 
hypoglycemia. Few of the available therapies adequately 
address underlying defects such as obesity and insuhn 
resistance. Thus, newer approaches are desperately needed 
(MoUer D. Nature. 414:821-827 (2001)). Current therapeu- 
tic approaches were largely developed in the absence of 
defined molecular targets or even a solid understanding of 
disease pathogenesis. The same holds true for the treatment 
of obesity, where treatments have limited lasting effects and 
many side effects. Therefore, there is a need for methods of 


identifying candidate therapeutics that target the biochemi- 
cal pathways related to the development of obesity and/or 
diabetes. 

[0123] Featured herein is a method for identifying candi- 
date therapeutics for reducing fat deposition, obesity and/or 
alleviating NIDDM. The method often comprises introduc- 
ing a test molecule to a system which comprises a nucleic 
acid comprising a P2X4 nucleotide sequence selected from 
the group consisting of: the nucleotide sequence of SEQ ID 
NO:l; a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO:2 or 
SEQ ID NO: 3; a nucleotide sequence which encodes a 
polypeptide that is 90% identical to the amino acid sequence 
of SEQ ID NO:2 or SEQ ID NO:3 or a nucleotide sequence 
90% identical to the nucleotide sequence of SEQ ID NO:l; 
and a fragment of one of the foregoing nucleotide sequences. 
Alternatively, the method comprises introducing a test mol- 
ecule to a system which comprises a protein encoded by one 
of the foregoing nucleotide sequences. The presence or 
absence of an interaction between the test molecule and the 
nucleic acid or protein is determined and the presence of an 
interaction between the test molecule and the nucleic acid or 
protein identifies the test molecule as a candidate therapeutic 
for fat reduction, obesity and/or alleviating NIDDM. 

[0124] As used herein, the term "test molecule" and "can- 
didate therapeutic" refers to modulators that regulate tran- 
scription and translation of P2X4 nucleic acids and modu- 
lators of expression and activity of P2X4 polypeptides. The 
term "modulator" as used herein refers to a molecule which 
agonizes or antagonizes P2X4 DNArepHcation and/or DNA 
processing (e.g., methylation), P2X4 RNA transcription 
and/or RNA processing (e.g., removal of intronic sequences 
and/or translocation from the nucleus), P2X4 polypeptide 
production (e.g., translation of the polypeptide from mRNA, 
and/or post-translational modification such as glycosylation, 
phosphorylation, and proteolysis of pro-polypeptides), and/ 
or P2X4 function (e.g., conformational changes, binding of 
nucleotides or nucleotide analogs, binding and/or transloca- 
tion of ions, interaction with binding partners, effect on 
membrane potential, effect on fat deposition, effect on 
metabolic condition, and/or effect on cardivascular condi- 
tion). Test molecules and candidate therapeutics include, but 
are not limited to, compounds, antisense nucleic acids, 
siRNA molecules, ribozymes, P2X4 polypeptide or frag- 
ments thereof, immunotherapeutics (e.g., antibodies). 

[0125] Compoimds 

[0126] Compounds may be utilized as test molecules for 
identifying candidate therapeutics for reducing fat deposi- 
tion. Compounds can be obtained using any of the numerous 
approaches in combinatorial hbrary methods known in the 
art, including: biological Hbraries; peptoid hbraries (hbraries 
of moleciUes having the functionalities of peptides, but with 
a novel, non-peptide bacld^one which are resistant to enzy- 
matic degradation but which nevertheless remain bio active 
(see, e.g., Zuckermann, R. N. et al., J. Med. Chem. 37: 
2678-85 (1994)); spatially addressable parallel solid phase 
or solution phase libraries; synthetic library methods requir- 
ing deconvolution; "one-bead one-compound" library meth- 
ods; and synthetic library methods using afBnity chroma- 
tography selection. Biological library and peptoid library 
approaches are typically limited to peptide hbraries, while 
the other approaches are appUcable to peptide, non-peptide 
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oligomer or small molecule libraries of compounds (Lam, 
Anticancer Drug Des. 12: 145,(1997)). Examples of meth- 
ods for synthesizing molecular libraries are described, for 
example, in DeWitt et al., Proc. Natl. Acad. ScL U.SA. 90: 
6909 (1993); Erb et al.,Proc. Natl Acad. ScL USA 91: 11422 
(1994); Zuckermann et al., /. Med. Chem. 37: 2678 (1994); 
Cho et al., Science 261: 1303 (1993); Carrell et ^L.Angew. 
Chem. Int. Ed. Engl. 33. 2059 (1994); CareU et al., A«^ew. 
Chem. Int. Ed. Engl. 33: 2061 (1994); and in Gallop et al., 
J. Med. Chem. 37. 1233 (1994). 

[0127] Libraries of compounds may be presented in solu- 
tion (e.g., Houghten, Biotechnigues 13; 412-421 (1992)), or 
on beads (Lam, Nature 354: 82-84 (1991)), chips (Fodor, 
Nature 364: 555-556 (1993)), bacteria or spores (Ladner, 
U.S. Pat. No. 5,223,409), plasmids (Cull et al., Proc. Natl. 
Acad. Sci. USA 89: 1865-1869 (1992)) or on phage (Scott 
and Smith, Science 249: 386-390 (1990); Devlin, Science 
249: 404-406 (1990); Cwirla et al., Prac. Natl. Acad. Sci. 87: 
6378-6382(1990); Felici,/. MoL Biol 222: 301-310(1991); 
Ladner supra.). 

[0128] Compounds may alter expression or activity of 
P2X4 polypeptides and may be a smaU molecule. Small 

molecules include, but are not limited to, peptides, pepddo- 
mimetics (e.g., peptoids), amino acids, amino acid analogs, 
polynucleotides, polynucleotide analogs, nucleotides, nucle- 
otide analogs, organic or inorganic compounds (i.e., includ- 
ing heteroorganic and organometallic compounds) having a 
molecular weight less than about 10,000 grams per mole, 
organic or inorganic compounds having a molecular weight 
less than about 5,000 grams per mole, organic or inorganic 
compounds having a molecular weight less than about 1,000 
grams per mole, organic or inorganic compounds having a 
molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of 
such compounds. 

[0129] Antisense, Ribozyme. siRNA, and Modified P2X4 
Nucleic Acid Molecules 

[0130] Also featured herein are antisense, ribozyme, 
siRNA, and modified P2X4 nucleic acids for use as test 

molecules in methods for identifying candidate therapeutics 
for reducing fat deposition or treating NIDDM, and for use 
as therapeutics for reducing fat deposition or treating 
NIDDM in a subject. An "antisense" nucleic acid refers to 
a nucleotide sequence which is complementary to a "sense" 
nucleic acid encoding a polypeptide, e.g., complementary to 
the coding strand of a double-stranded cDNA molecule or 
complementary to an mRNA sequence. The antisense 
nucleic acid can be complementary to an entire P2X4 coding 
strand, or to only a portion thereof (e.g., the coding region 
of human P2X4 corresponding to SEQ ID NO: 1). In another 
embodiment, the antisense nucleic acid molecule is anti- 
sense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding P2X4 (e.g., 5' and 3' untrans- 
lated regions). 

[0131] An antisense nucleic acid can be designed such that 
it is complementary to the entire coding region of P2X4 
mRNA, and often the antisense nucleic acid is an oligo- 
nucleotide that is antisense to only a portion of a codii^ or 
noncoding region of P2X4 mRNA. For example, the anti- 
sense oligonucleotide can be complementary to the region 
surrounding the translation start site of P2X4 mRNA, e.g., 
between the -10 and +10 regions of the target gene nucle- 


otide sequence of interest. An antisense oligonucleotide can 
be, for example, about 7, 10, 15, 20, 25, 30, 35, 40, 45, 50, 
55, 60, 65, 70, 75, 80, or more nucleotides in length. The 
antisense nucleic acids, which include the ribozymes 
described hereafter, can be designed to target P2X4 nucleic 
acid or P2X4 nucleic acid variants. Among the variants, 
minor alleles and major alleles can be targeted, and those 
associated with a higher risk to fat deposition, such as alleles 
having an adenine at position 11030, a thymine at position 
15847 or a cytosine at position 17338 in the P2X4 nucleotide 
sequence represented by SEQ ID NO:l, often are designed, 
tested, and administered to subjects. Similarly, antisense 
nucleic acids, which include the ribozymes described here- 
after, can be designed to target P2X4 nucleic acid or P2X4 
nucleic acid variants associated with a higher risk of devel- 
oping NIDDM, such as nucleic acids having a thymine at 
positions 15847 of SEQ ID NO:l. An antisense nucleic acid 
can be constructed using chemical synthesis and enzymatic 
ligation reactions using standard procedures. For example, 
an antisense nucleic acid (e.g., an antisense oligonucleotide) 
can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to 
increase the biological stability of the molecules or to 
increase the physical stability of the duplex formed between 
the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 
Antisense nucleic acid also can be produced biolc^caUy 
using an expression vector into which a nucleic acid has 
been subcloned in an antisense orientation (i.e., RNA tran- 
scribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described 
further in the following subsection). 

[0132] Antisense nucleic acids are typically administered 
to a subject (e.g., by direct injection at a tissue site) or 
generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a P2X4 
polypeptide and thereby inhibit expression of the polypep- 
tide, for example, by inhibiting transcription and/or trans- 
lation. Alternatively, antisense nucleic acid molecules can be 
modified to target selected cells and then administered 
systemicaUy. For systemic administration, antisense mol- 
ecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, for 
example, by linking antisense nucleic acid molecules to 
peptides or antibodies which bind to cell surface receptors or 
antigens. Antisense nucleic acid molecules can also be 
delivered to cells using the vectors described herein. SuflS- 
cient intracellular concentrations of antisense molecules are 
achieved by incorporating a strong promoter, such as a pol 
II or pol III promoter, in the vector construct. 

[0133] Antisense nucleic acid molecules are sometimes 
a-anomeric nucleic acid molecules. An ct-anomeric nucleic 
acid molecule forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual p-units, 
the strands run parallel to each other (Gaultier et al., Nucleic 
Acids. Res. 15: 6625-6641 (1987)). Antisense nucleic acid 
molecules can also coniprise a 2'-o-methylribonucleotide 
(Inoue et SLl.,Nucleic Acids Res. 15: 6131-6148 (1987)) or a 
chimeric RNA-DNA analogue (Inoue et al., FEBSLett. 215: 
327-330 (1987)). 

[0134] In another embodiment, tin cuitisense nucleic acid is 
a ribozyme. A ribozyme having specificity for a P2X4- 
encoding nucleic acid can include one or more sequences 
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complementary to the nucleotide sequence of a P2X4 DNA 
sequence disclosed herein (e.g., SEQ ID NO:l), and a 
sequence having a known catalytic sequence responsible for 
mRNA cleavage (see U.S. Pat. No. 5,093,246 or HaseUioff 
and Gerlach, Nature 334: 585-591 (1988)). For example, a 
derivative of a Tetrahymena L-19 IVS RNA is sometimes 
utilized in which the nucleotide sequence of the active site 
is complementary to the nucleotide sequence to be cleaved 
in a P2X4-encoding mRNA. See, e.g., Cech et al. U.S. Pat. 
No. 4,987,071; and Cech et al. U.S. Pat. No. 5,116,742. 
Also, P2X4 mRNA can be used to select a catalytic RNA 
having a specific ribonuclease activity Izom a pool of RNA 
molecules. See, e.g., Bartel & Szostak, Science 261: 1411- 
1418 (1993). 

[0135] P2X4 gene expression can be inhibited by targeting 
nucleotide sequences complementary to the regulatory 
region of the P2X4 (e.g., P2X4 promoter and/or enhancers) 
to form triple helical structures that prevent transcription of 
the P2X4 gene in target cells. See, H&l&n&, Anticancer Drug 
Des. 6(6): 569-84 (1991); Helene et al., Anw. N.Y.Acad. ScL 
660: 27-36 (1992); and Maher, Bioassays 14(12): 807-15 
(1992). Potential sequences that can be targeted for triple 
helix formation can be increased by creating a so-called 
''switchback" nucleic acid molecule. Switchbadc molecules 
are synthesized in an alternating 5'-3', 3-5' manner, such that 
they base pair with first one strand of a duplex and then the 
other, ehminating the necessity for a sizeable stretch of 
either purines or pyrimidines to be present on one strand of 
a duplex. 

[0136] P2X4 expression may be inhibited by the introduc- 
tion of double-stranded RNA (dsRNA), whictt induces 
potent and specific gene silencing, a phenomenon called 

RNA interference or RNAi. See, e.g.. Fire et al., U.S. Pat. 
No. 6,506,559; Tuschl et al. PCX International Publication 
No. WO 01/75164; Kay et al. PCX International Publication 
No. WO 03/010180A 1; or Bosher J M, Labouesse,iVflr Cell 
Biol February 2000; 2(2) :E3 1-6. Xhis process has been 
improved by decreasing the size of the double -stranded 
RNA to 20-24 base pairs (to create small-interferit^ RNAs 
or siRNAs) that "switched off" genes in mammalian cells 
without initiating an acute phase response, i.e., a host 
defense mechanism that often results in cell death. See, e.g., 
Caplen et al. Proc Natl Acad Sci USA. Aug. 14, 
2001;98(17):9742-7 and Elbashir S M et al. Methods Feb- 
ruary 2002;26(2): 199-213. There is increasing evidence that 
post-transcriptional gene silencing by RNA interference 
(RNAi) for inhibiting targeted expression in mammalian 
cells at the mRNA level is effective in-human cells. Xhere is 
additional evidence of effective methods for inhibiting the 
proliferation and migration of tumor cells in human patients, 
and for inhibiting metastatic cancer development. See, e.g., 
U.S. Patent Application Number US200 1000993 183; 
Caplen N J et al. Proc Natl Acad Sci USA; and Abderrah- 
mani A. et al. Mot Cell Biol Nov. 21, 2001(21):7256-67. 

[0137] An "siRNA" or "RNAi" refers to a nucleic acid 
that forms a double stranded RNA and has the ability to 
reduce or inhibit expression of a gene or target gene when 
the siRNAis dehvered to or expressed in the same cell as the 
gene or target gene. "siRNA" thus refers to short double 
stranded RNA formed by the complementary strands. 
Complementary portions of the siRNA that hybridize to 
form the double stranded molecule often have substantial or 
complete identity to the target molecule sequence. In one 


embodiment, an siRNA refers to a nucleic acid that has 
substantial or complete identity to a target gene and forms a 
double stranded siRNA, such as a nucleotide sequence in 
SEQ ID NO: 1, for example. 

[0138] When designing the siRNA molecules, the targeted 
region often is selected &om a given DNA sequence begin- 
ning 50 to 100 nt downstream of the start codon. See, e.g., 
Elbashir et al. Methods 26:199-213 (2002). Initially, 5' or 3' 
UTRs and regions nearby the start codon were avoided 
assimning that UTR-binding proteins and/or translation ini- 
tiation complexes may interfere with binding of the siRNP 
or RISC endonuclease complex. Sometimes regions of the 
target 23 nucleotides in length conforming to the sequence 
motif AA(N19)XX (N, an nucleotide), and regions with 
approximately 30% to 70% G/C-content (often about 50% 
G/C-content) often are selected. If no suitable sequences are 
found, the search often is extended using the motif 
NA(N21). The sequence of the sense siRNA sometimes 
corresponds to (N19) TT or N21 (position 3 to 23 of the 
23-nt motif), respectively. In the latter case, the 3' end of the 
sense siRNA often is converted to TT. The rationale for this 
sequence conversion is to generate a symmetric duplex with 
respect to the sequence composition of the sense and anti- 
sense 3' overhangs. The antisense siRNA is synthesized as 
the complement to position 1 to 21 of the 23-nt motif. 
Because position I of the 23-nt motif is not recognized 
sequence -specifically by the antisense siRNA, the 3 '-most 
nucleotide residue of the antisense siRNA can be chosen 
deliberately. However, the penultimate nucleotide of the 
antisense slRNA (complementary to position 2 of the 23-nt 
motif) often is complementary to the targeted sequence. For 
simphfying chemical sjoithesis, TT often is utilized. siRNAs 
corresponding to the target motifNAR(N17)YNN, where R 
is purine (A,G) and Y is pyrimidine (C,U), often are 
selected. Respective 21 nucleotide sense and antisense siR- 
NAs often begin with a purine nucleotide and can also be 
expressed from pol III expression vectors without a change 
in targeting site. Expression of RNAs from pol III promoters 
often i& efEicient when the first transcribed nucleotide is a 
purine. 

[0139] The sequence of the siRNA can correspond to the 
filll length target gene, or a subsequence thereof. Often, the 
siRNA is about 15 to about 50 nucleotides in length (e.g., 
each complementary sequence of the double stranded siRNA 
is 15-50 nucleotides in length, and the double stranded 
siRNAis about 15-50 base pairs-in length, sometimes about 
20-30 nucleotides in length or about 20-25 nucleotides in 
length, e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 
nucleotides in length. The siRNA often is about 21 nucle- 
otides in length. Methods of using siRNA are well known in 
the art, and specific siRNA molecules may be purchased 
from a number of companies including Dharmacon 
Research, Inc. 

[0140] Antisense, ribozyme, and modified P2X4 nucleic 
acid molecules can be altered at base moieties, sugar moi- 
eties or phosphate backbone moieties to improve stability, 
hybridization, or solubiUty of the molecule. For example, 
the deoxyribose phosphate backbone of nucleic acid mol- 
ecules can be modified to generate peptide nucleic acids (see 
Hyrup et al., Bioorganic & Medicinal Chemistry 4 (1). 5-23 
(1996)). As used herein, the terms "peptide nucleic acid" or 
"PNA" refers to a nucleic acid mimic such as a DNA mimic, 
in which the deoxyribose phosphate backbone is replaced by 
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a pseudopeptide backbone and only the four natural nucleo- 
bases are retained. The neutral backbone of a PNAcan allow 
for specific hybridization to DNAand RNA under conditions 
of low ionic strength. Synthesis of PNA ohgomers can be 
performed using standard solid phase peptide synthesis 
protocols as described, for example, in Hyrup et al., (1996) 
supra and Perry-O'Keefe et al., Proc. Natl. Acad. Sci. 93: 
14670-675 (1996). 

[0141] PNAs of P2X4 nucleic acids can be used in thera- 
peutic and diagnostic applications. For example, PNAs can 
be used as antisense or aatigeae agents for sequence-specific 
modulation of gene expression by, for example, inducing 

transcription or translation arrest or inhibiting replication. 
PNAs of P2X4 nucleic acid molecules can also be used in 
the analysis of single base pair mutations in a gene, (e.g., by 
PNA-directed PGR clamping); as "artificial restriction 
enzymes" when used in combination with other enzymes, 
(e.g., SI nucleases (Hynip (1996) supra)); or as probes or 
primers for DNA sequencing or hybridization (Hynip et al., 
(1996) supra; Perry-O'Keefe supra). 

[0142] In other embodiments, ohgonucleotides may 
include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo), or agents facilitating 
transport across cell membranes (see, e.g., Letsinger et al., 
Proc. Natl. Acad. Sci. USA 86: 6553-6556 (1989); Lemaihre 
et aL, Proc. Natl Acad. Set USA 84: 648-652 (1987); PCT 
Publication No. WO88/09810) or the blood-brain barrier 
(see, e.g., PCT Publication No. WO89/10134). In addition, 
oligonucleotides can be modified with hybridization-trig- 
gered cleavage agents (See, e.g., Krol et dX., Bio-Techniques 
6: 958-976 (1988)) or intercalating agents, (See, e.g., Zon, 
Pharm. Res. 5. 539-549 (1988)). To this end, the oligonucle- 
otide may be conjugated to another molecule, (e.g., a 
peptide, hybridization triggered cross-linking agent, trans- 
port agent, or hybridization-triggered cleavage agent). 

[0143] Also included herein are molectilar beacon oligo- 
nucleotide primer and probe molecules having one or more 
regions which are complementary to a P2X4 nucleic acid of 
the invention, two complementary regions one having a 
fluorophore and one a quencher such that the molecular 
beacon is usefiil for quantifying the presence of the P2X4 
nucleic acid of the invention in a sample. Molecular beacon 
nucleic acids are described, for example, in Lizardi et al., 
U.S. Pat. No. 5,854,033; Nazarenko et al, U.S. Pat. No. 
5,866,336, and Livak et al, U.S. Pat. No. 5,876,930. 

[0144] Anti-P2X4 Antibodies 

[0145] In an embodiment, antibodies are screened as test 
molecules and used as therapeutics for reducing fat deposi- 
tion or treating NIDDM in a subject. The term "antibody" as 
used herein refers to an immunoglobulin molecule or immu- 
nologically active portion thereof, i.e., an antigen-binding 
portion. Examples of immunologically active portions of 
immimoglobuUn molecules include F(ab) and F(ab')2 frag- 
ments which can be generated by treating the antibody with 
an enzyme suc^ as pepsin. An antibody can be a polyclonal, 
monoclonal, recombinant, e.g., a chimeric or humanized, 
fully human, non-human, e.g., murine, or single chain 
antibody. An antibody may have effector function and can 
fix complement, and is sometimes coupled to a toxin or 
imaging agent. 

[0146] A fuU-length P2X4 polypeptide or, antigenic pep- 
tide fragment of P2X4 can be used as an immunogen or can 


be used to identify anti-P2X4 antibodies made with other 
immimogens, e.g., cells, membrane preparations, and the 
like. The antigenic peptide of P2X4 should include at least 
8 amino acid residues of the amino acid sequence shown in 
SEQ ID NO:2 and encompasses an epitope of P2X4. Anti- 
genic p^tides include 10 or more amino adds, 15 or more 
amino acids, often 20 or more amino acids, and typically 30 
or more amino acids. Hydrophilic and hydrophobic frag- 
ments of P2X4 poljfpeptides can be used as rmmunogens. 

[0147] Epitopes encompassed by the antigenic peptide are 
regions of P2X4 located on the surface of the polypeptide 
(e.g., hydrophilic regions) as well as regions with high 
antigenicity. For example, an Emini surface probability 
analysis of the human P2X4 polypeptide sequence can be 
used to indicate the regions that have a particularly high 
probability of being localized to the surface of the P2X4 
polypeptide and are thus likely to constitute surface residues 
useful for targeting antibody production. The antibody may 
bind an epitope on any domain or region on P2X4 polypep- 
tides described herein. 

[0148] Also, chimeric, humanized, and completely human 
antibodies are useful for applications which include repeated 
administration to subjects. Chimeric and humanized mono- 
clonal antibodies, comprising both human and non-human 
portions, can be made using standard recombinant DNA 
techniques. Such chimeric and humanized monoclonal anti- 
bodies can be produced by recombinant DNA techniques 
known in the art, for example using methods described in 
Robinson et al. Intemational Application No. PCT/US86/ 
02269; Akira, et al. European Patent Application 184,187; 
Taniguchi, M., European Patent Apphcation 171,496; Mor- 
rison et al. European Patent Apphcation 173,494; Neuberger 
et al. PCT International Pubhcation No. WO 86/01533; 
Cabilly et al. U.S. Pat. No. 4,816,567; Cabilly et al. Euro- 
pean Patent Application 125,023; Better et al, Science 240: 
1041-1043 (1988); Liu et al., Proc. Natl Acad. Sci. USA 84: 
3439-3443 (1987); Liu et al., J. Immunol. 139: 3521-3526 
(1987); Sun et al., Proc. Natl Acad. Sci. USA 84: 214-218 
(1987); Nishimura et al.. Cane. Res. 47: 999-1005 (1987); 
Wood et al. Nature 314: 446-449 (1985); and Shaw et al, 
J. Natl. Cancer Inst. 80: 1553-1559 (1988); Morrison, S. L., 
Science 229: 1202-1207 (1985); Oi et al., BioTechniques 4: 
214 (1986); Winter U.S. Pat. No. 5,225,539; Jones et al.. 
Nature 321. 552-525 (1986); Verhoeyan et al., Science 239: 
1534; and Beidler et al., J. Immunol 141: 4053-4060 (1988). 

[0149] Completely human antibodies are particularly 

desirable for therapeutic treatment of human patients. Such 
antibodies can be produced using transgenic mice that are 
incapable of expressing endogenous immunoglobuhn heavy 
and light chains genes, but which can express human heavy 
and fight chain genes. See, for example, Lonberg and 
Huszar,/M?. Rev. Immunnl. 13: 65-93 (1995); and U.S. Pat. 
Nos. 5,625,126; 5,633,425; 5,569,825; 5,661,016; and 
5,545,806. In addition, companies sadti as Abgenix, Inc. 
(Fremont, Calif.) and Medarex, Inc. (Princeton, N.J.), can be 
engaged to provide human antibodies directed against a 
selected antigen using technology similar to that described 
above. Completely human antibodies that recognize a 
selected epitope also can be generated using a technique 
referred to as "guided selection." In this approach a selected 
non-human monoclonal antibody (e.g., a murine antibody) is 
used to guide the selection of a completely human antibody 
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recognizing the same epitope. This technology is described 
for example by Jespers et al., Bio/Technology 12. 899-903 
(1994). 

[0150] An anti-P2X4 antibody can be a single chain 
antibody. A single chain antibody (scFV) can be engineered 
(see, e.g., Colcher, D. et al.^ Arm. NY Acad. Sci. 880: 263-80 
(1999); and Reiter, Y., Clin. Cancer Res. 2: 245-52 (1996)). 
Single chain antibodies can be dimerized or multimerized to 
generate multivalent antibodies having specificities for dif- 
ferent epitopes of the same target P2X4 polypeptide. 

[0151] Antibodies also may be selected or modified so that 
they exhibit reduced or no ability to bind an Fc receptor. For 
example, an antibody may be an isotype or subtype, frag- 
ment or other mutant, which does not support binding to an 
Fc receptor (e.g., it has a mutagenized or deleted Fc receptor 
binding region). 

[0152] Also, an antibody (or fragment thereof) may be 
conjugated to a therapeutic moiety such as a cjrtotoxin, a 
therapeutic agent or a radioactive metal ion. A cytotoxin or 
cytotoxic agent includes any agent that is detrimental to 
cells. Examples include taxol, cytochalasin B, gramicidin D, 
ethidium bromide, emetine, mitomycin, etoposide, tenopo- 
side, vincristine, vinblastine, colchicin, doxorubicin, dauno- 
rubicin, dihydroxy anthracin dione, mitoxantrone, mithra- 
mycin, actinomycin D, 1-dehydro testosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, 
and puromycin and analogs or homologs thereof. Therapeu- 
tic agents include, but are not limited to, antimetabolites 
(e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cyt- 
arabine, 5-fiuorouracil decarbazine), alkylating agents (e.g., 
mechlorethamine, thioepa chlorambucil, melphalan, car- 
mustine (BSNU) and lomustine (CCNU), cyclothospha- 
mide, busulfan, dibromomannitol, streptozotocin, mitomy- 
cin C, and cis-dichlorodiamine platinum (II) (DDP) 
cisplatin), anthracyclines (e.g., daunombicin (formerly 
daunomycin) and doxorubicin), antibiotics (e.g., dactinomy- 
cin (formerly actinomycin), bleomycin, mithramycin, and 
anthramycin (AMC)), and anti-mitotic agents (e.g., vincris- 
tine and vinblastine). 

[0153] Antibody conjugates can be used for modifying a 
given biological response. For example, the drug moiety 
may be a protein or polypeptide possessing a desired bio- 
logical activity. Such proteins may include, for example, a 
toxin such as abrin, ricin A, pseudomonas exotoxin, or 
diphtheria toxin; a polypeptide such as tumor necrosis 
factor, c(.-interferon, P -interferon, nerve growth factor, plate- 
let derived growth factor, tissue plasminogen activator; or, 
biological response modifiers such as, for example, lym- 
phokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), 
interleukin-6 ("lL-6"), granulocyte macrophage colony 
stimulating factor ("GM-CSF"), granulocyte colony stimu- 
lating factor ("G-CSF"), or other growth factors. Also, an 
antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S- Pat. 
No. 4,676,980, for example. 

[0154] An anti-P2X4 antibody (e.g., monoclonal anti- 
body) can be used to isolate P2X4 polypeptides by standard 
teclmiques, such as affinity chromatography or immunopre- 
cipitation. Moreover, an anti-P2X4 antibody can be used to 

detect a P2X4 polypeptide (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abimdance and pattern 
of expression of the polypeptide. Anti-P2X4 antibodies can 


be used diagnostically to monitor polypeptide levels in 
tissue as part of a clinical testing procedure, e.g., to deter- 
mine the efiBcacy of a given treatment regimen. Detection 
can be facilitated by couphng (i.e., physically linking) the 
antibody to a detectable substance (i.e., antibody labeling). 
Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent mate- 
rials, bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish peroxi- 
dase, alkaline phosphatase, |3-galac^osidase, or acetylcho- 
linesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluo- 
rescein, fluorescein isothiocyanate, rhodamine, dichlorotri- 
azinylamine fluorescein, dansyl chloride or phycoerythrin; 
an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, 
luciferin, and aequorin, and examples of suitable radioactive 
material include ^^l, ^^S or ^H. 

[0155] An antibody can be made by immunizing with a 
purified P2X4 antigen, or a fragment thereof, e.g., a frag- 
ment described herein, a membrane associated antigen, 
tissues, e.g., crude tissue preparations, whole cells, prefer- 
ably living cells, lysed cells, or cell fractions. 

[0156] Included herein are antibodies which bind only a 
native P2X4 polypeptide, only denatured or otherwise non- 
native P2X4 polypeptide, or which bind both, as well as 

those having linear or conformational epitopes. Conforma- 
tional epitopes sometimes can be identified by selecting 
antibodies that bind to native but not denatured P2X4 
polypeptide. 

[0157] Screening Assays 

[0158] As used herein, the term "system" refers to a cell 
free in vitro environment and a cell-based environment such 
as a collection of cells, a tissue, an organ, or an organism. A 
system is contacted or introduced to a test molecule in a 
variety of manners, including adding molecules in solution 
and allowing them to interact with one another by diffiisioo, 
cell injection, and any administration routes in an animal. As 
used herein, the term "interaction" refers to an effect of a test 
molecule on a P2X4 nucleic acid, polypeptide, or variant 
thereof (collectively referred to as a "P2X4 molecule"), 
where the effect is sometimes binding between the test 
molecule and the nucleic acid or polypeptide, and often is an 
observable change in cells, tissue, or organism. There are 
many standard methods for detecting the presence or 
absence of interaction between a test molecule and a P2X4 
nucleic acid or polypeptide, as described hereafter. 

[0159] An interaction can be determined by labeling the 
test molecule and/or the P2X4 molecule, where the label is 
covalently or non-covalently attached to the test molecule or 
P2X4 molecule. The label is sometimes a radioactive mol- 
ecule such as ^^S, ^"^C, or ^H, which can be detected by 
direct counting of radioemmission or by scintillation count- 
ing. Also, enzymatic labels such as horseradish peroxidase, 
alkaline phosphatase, or luciferase may be utiHzed where the 
enzymatic label can be detected by determining conversion 
of an appropriate substrate to product. Also, presence or 
absence of an interaction can be determined without label- 
ing. For example, a microphysiometer (e.g., Cytosensor) is 
an analytical instrument that measures the rate at which a 
cell acidifies its environment using a light-addressable 
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potentiometric sensor (LAPS). Changes in this acidification 
rate can be used as an indication of an interaction between 
a test molecule and P2X4 (McConnell, H, M, et al,. Science 
257: 1906-1912 (1992)). 

[0160] In cell-based systems, cells typically include a 
P2X4 nucleic acid or polypeptide or variants thereof and 
often are of mammalian origin, although the cell can be of 
any origin. Whole cells, cell homogenates, and ceU fractions 
(e.g., cell membrane fractions) can be subjected to analysis. 
Where interactions between a test molecule with a P2X4 
polypeptide or variant thereof are monitored, soluble and/or 
membrane bound forms of the polypeptide or variant may be 
utilized. Where membrane -bound forms of the polypeptide 
are used, it may be desirable to utilize a solubilizing agent. 
Examples of such solubilizing agents include non-ionic 
detergents such as n-octylglucoside, n-dodecylglucoside, 
n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® 
X-114, Thesit®, Isotridecypoly(ethylene glycol ether)^^, 
3 -[(3-cholam idop rop y l)dimethy lamminio ] - 1 -propane sul- 
fonate (CHAPS), 3-[(3-cholamidopropyl)dimethylain- 
minio]-2-hydrQxy-l-propane sulfonate (CHAPS O), or 
N-dodecyl-N,N-dimethyl-3-ammonio- 1 -propane sulfonate. 

[0161] An interaction between two molecules can also be 
detected by monitoring fluorescence energy transfer (FET) 
(see, for example, Lakowicz et al., U.S. Pat. No. 5,631,169; 
Stavrianopoulos, et al., U.S. Pat. No. 4,868,103). A fluoro- 
phore label on a first, "donor" molecule is selected such that 
its emitted fluorescent energy will be absorbed by a fluo- 
rescent label on a second, "acceptor" molecule, which in 
turn is able to fluoresce due to the absorbed energy. Alter- 
nately, the "donor" polypeptide molecule may simply utilize 
the natural fluorescent energy of tryptophan residues. Labels 
are chosen that emit different wavelengths of light, such that 
the "acceptor" molecule label may be differentiated from 
that of the "donor". Since the efficiency of energy transfer 
between the labels is related to the distance separating the 
molecules, the spatial relationship between the molecules 
can be assessed. In a situation in which binding occurs 
between the molecules, the fluorescent emission of the 
"acceptor" molecule label in the assay should be maximal. 
An FET binding event can be conveniently measured 
through standard fluorometric detection means well known 
in the art (e.g., using a fluorimeter). 

[0162] In another embodiment, determining the presence 
or absence of an interaction between a test molecule and a 
P2X4 molecule can be effected by using real-time Biomo- 
lecular Interaction Analysis (BIA) (see, e.g., Sjolander, S. 
and Urbaniczky, C.,AnaL Chem. 63: 2338-2345 (1991) and 
Szabo et al., Curr. Opin. Struct. BioL 5: 699-705 (1995)). 
"Surface plasmon resonance" or "BIA" detects biospeciflc 
interactions in real time, without labeling any of the inter- 
actants (e.g., BIAcore). Changes in the mass at the binding 
surface (indicative of a binding event) result in alterations of 
the refractive index of light near the surface (the optical 
phenomenon of surface plasmon resonance (SPR)), resulting 
in a detectable s^nal which can be used as an indication of 
real-time reactions between biological molecules. 

[0163] In another embodiment, the P2X4 molecule or test 

molecules are anchored to a solid phase. The P2X4 mol- 
ecule/test molecule complexes anchored to the solid phase 
can be detected at the end of the reaction. The target P2X4 


molecule often is anchored to a solid surface, and the test 
molecule, which is not anchored, can be labeled, either 
directly or indirectly, with detectable labels discussed 

herein. 

[0164] It may be desirable to immobilize a P2X4 mol- 
ecule, an anti-P2X4 antibody, or test molecules to facilitate 
separation of complexed from uncomplexed forms of P2X4 
molecules and test molecules, as weU as to accommodate 
automation of the assay. Binding of a test molecule to a 
P2X4 molecule can be accomplished in any vessel suitable 
for containing the reactants. Examples of such vessels 
include microtiter plates, test tubes, and micro -centrifuge 
tubes. In one embodiment, a fusion polypeptide can be 
provided which adds a domain that allows a P2X4 molecule 
to be bound to a matrix. For example, glutathione-S-trans- 
ferase/P2X4 fusion polypeptides or glutathione-S-trans- 
ferase/target fusion polypeptides can be adsorbed onto glu- 
tathione sepharose beads (Sigma Chemical, St. Louis, Mo.) 
or glutathione derivatized microtiter plates, which are then 
combined with the test compound or the test compound and 
either the non- adsorbed target polypeptide or P2X4 polypep- 
tide, and the mixture incubated under conditions conducive 
to complex formation (e.g., at physiological conditions for 
salt and pH). Following incubatioa, the beads or microtiter 
plate wells are washed to remove any unbound components, 
the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as 
described above. Alternatively, the complexes can be disso- 
ciated from the matrix, and the level of P2X4 binding or 
activity determined using standard techniques. 

[0165] Other techniques for immobilizing a P2X4 mol- 
ecule on matrices include using biotin and streptavidin. For 
example, biotinylated P2X4 polypeptide or target molecules 
can be prepared from biotin-NHS(N-hydroxy-succinimide) 
using techniques known in the art (e.g., biotinylation kit. 
Pierce Chemicals, Rockford, lU.), and immobilized in the 
wells of strep tavidin-coated 96 well plates (Pierce Chemi- 
cal). 

[0166] In order to conduct the assay, the non-immobilized 

component is added to the coated surface containing the 
anchored component. After the reaction is complete, unre- 
acted components are removed (e.g., by washing) under 
conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection of com- 
plexes anchored on the solid surface can be accomplished in 
a number of ways. Where the previously non-immobilized 
component is pre-labeled, the detection of label immobilized 
on the surface indicates that complexes were formed. Wliere 
the previously non-immobilized component is not pre-la- 
beled, an indirect label can be used to detect complexes 
anchored on the surface; e.g., using a labeled antibody 
specific for the immobilized component (the antibody, in 
turn, can be directly labeled or indirectly labeled with, e.g., 
a labeled anti-Ig antibody). 

[0167] In one embodiment, this assay is performed utiliz- 
ing antibodies reactive with P2X4 polypeptide or test mol- 
ecules but which do not interfere with binding of the P2X4 
polypeptide to its test molecule. Such antibodies can be 
derivatized to the wells of the plate, and unbound target or 
P2X4 poljrpeptide trapped in the weUs by antibody conju- 
gation. Methods for detecting such complexes, in addition to 
those described above for the GST-immobilized complexes. 
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include immunodetection of complexes using antibodies 
reactive with the P2X4 polypeptide or target molecule, as 
well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the P2X4 polypeptide or 
test molecule. 

[0168] Alternatively, cell free assays can be conducted in 
a liquid phase. In such an assay, the reaction products are 
separated from unreacted components, by any of a number 
of standard techniques, including but not limited to: diflfer- 
ential centrifugation (see, for example, Rivas, G., and 
Minton, A. P., Trends Biochem Sci A«g;18(8): 284-7 
(1993)); chromatography (gel filtration chromatography, 
ion-exchange chromatography); electrophoresis (see, e.g., 
Ausubel, F. et al., eds. Current Protocols in Molecular 
Biology, J. Wiley: New York (1999)); and immimoprecipi- 
tation (see, for example, Ausubel, F. et al., eds. Current 
Protocols in Molecular Biology, J. Wiley: New York 
(1999)). Such resins and chromatographic techniques are 
known to one skilled in the art (see, e.g., Heegaard, N. H., 
J. Mol. Recognit. Winter; 11(1-6): 141-8 (1998); Hage, D. 
S., and Tweed, S. A., J. Chromatogr. B Biomed. Sci. Appl 
October 10; 699 (1-2): 499-525 (1997)). Further, fluores- 
cence enetgy transfer may also be conveniently utilized, as 
described herein, to detect binding without further purifica- 
tion of the complex from solution. 

[0169] In another embodiment, modulators of P2X4 
expression are identified. For example, a cell or cell free 
mixture is contacted with a candidate compound and the 
expression of P2X4 mRNAor polypeptide evaluated relative 
to the level of expression of P2X4 mRNA or polypeptide in 
the absence of the candidate compound. When expression of 
P2X4 mRNAor polypeptide is greater in the presence of the 
candidate compound than in its absence, the candidate 
compound is identified as a stimulator of P2X4 mRNA or 
polypeptide expression. Alternatively, when expression of 
P2X4 mRNA or polypeptide is less (statistically signifi- 
cantly less) in the presence of the candidate compound than 
in its absence, the candidate compound is identified as an 
inhibitor of P2X4 mRNA or polypeptide expression. The 
level of P2X4 mRNA or polypeptide expression can be 
determined by methods described herein for detecting P2X4 
mRNA or polypeptide. 

[0170] P2X4 Binding Partners 

[0171] In another embodiment, binding partners that inter- 
act with a P2X4 molecule are detected. The P2X4 molecules 
can interact with one or more cellular or extracellular 
macromolecules, such as polypeptides, in vivo, and these 
molecules that interact with P2X4 molecules are referred to 
herein as "binding partners." Molecides that disrupt such 
interactions can be useful in regulating the activity of the 
target gene product. Such molecules can include, but are not 
Limited to molecules such as antibodies, peptides, and smaU 
molecules. The preferred target genes/products for use in 
this embodiment are the P2X4 genes herein identified. In an 
alternative embodiment, the invention provides methods for 
determining the ability of the test compound to modulate the 
activity of a P2X4 polypeptide through modulation of the 
activity of a downstream effector of a P2X4 target molecule. 
For example, the activity of the effector molecule on an 
appropriate target can be determined, or the binding of the 
effector to an appropriate target can be determined, as 
previously described. 


[0172] To identify compounds that interfere with the inter- 
action between the target gene product and its cellular or 
extracellular binding partner(s), e.g., a substrate, a reaction 
mixture containing the target gene product and the binding 
partner is prepared, under conditions and for a time suffi- 
cient, to allow the two products to form complex. In order 
to test an inhibitory agent, the reaction mixture is provided 
in the presence and absence of the test compoimd. The test 
compound can be initially included in the reaction mixture, 
or can be added at a time subsequent to the addition of the 
target gene and its cellular or extracellular binding partner. 
Control reaction mixtures are incubated without the test 
compound or with a placebo. The formation of any com- 
plexes between the target gene product and the cellular or 
extracellular binding partner is then detected. The formation 
of a complex in the control reaction, but not in the reaction 
mixture containing the test compound, indicates that the 
compound interferes with the interaction of the target gene 
product and the interactive binding partner. Additionally, 
complex formation within reaction mixtures containing the 
test compound and normal target gene product can also be 
compared to complex formation within reaction mixtures 
containing the test compound and mutant target gene prod- 
uct. TTiis comparison can be important in those cases 
wherein it is desirable to identify compounds that disrupt 
interactions of mutant but not normal target gene products. 

[0173] These assays can be conducted in a heterogeneous 
or homogeneous format. Heterogeneous assays involve 
anchoring either the target gene product or the binding 
partner onto a solid phase, and detecting complexes 
anchored on the solid phase at the end of the reaction. In 
homogeneous assays, the entire reaction is carried out in a 
liquid phase. In either approach, the order of addition of 
reactants can be varied to obtain difi'erent information about 
the compounds being tested. For example, test compounds 
that interfere with the interaction between the target gene 
products and the binding partoers, e.g., by coropetition, can 
be identified by conducting the reaction in the presence of 
the test substance. Alternatively, test compounds that dismpt 
preformed complexes, e.g., compounds with higher binding 
constants that displace one of the components from the 
complex, can be tested by adding the test compound to the 
reaction mixture after complexes have been formed. The 
various formats are briefly described below. 

[0174] In a heterogeneous assay system, either the target 

gene product or the interactive cellular or extracellular 
binding partner, is anchored onto a solid surface (e.g., a 
microtiter plate), while the non-anchored species is labeled, 
either directly or indirectly. The anchored species can be 
immobilized by non-covalent or covalent attachments. 
Alternatively, an immobilized antibody specific for the spe- 
cies to be anchored can be used to anchor the i^ecies to the 
solid surface. 

[0175] In order to conduct the assay, the partner of the 

immobilized species is exposed to the coated surface with or 
without the test compound. After the reaction is complete, 
unreacted components are removed (e.g., by washing) and 
any complexes formed will remain immobilized on the soUd 
surface. Where the non-immobilized species is pre-labeled, 
the detection of label immobilized on the surface indicates 
that complexes were formed. Where the non-immobilized 
species is not pre-labeled, an indirect label can be used to 
detect complexes anchored on the surface; e.g., using a 
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labeled antibody specific for the initially non-immobilized 
species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). 
Depending upon the order of addition of reaction compo- 
nents, test compounds that inhibit complex formation or that 
disrupt preformed complexes can be detected. 

[0176] Alternatively, the reaction can be conducted in a 
liquid phase in the presence or absence of the test com- 
pound, the reaction products separated firom unreacted com- 
ponents, and complexes detected; e.g., using an immobilized 
antibody specific for one of the binding components to 
anchor any complexes formed in solution, and a labeled 
antibody specific for the other partner to detect anchored 
complexes. Again, depending upon the order of addition of 
reactants to the liquid phase, test compounds that inhibit 
complex or that disrupt preformed complexes can be iden- 
tified. 

[0177] In an alternate embodiment of the invention, a 
homogeneous assay can be used. For example, a preformed 
complex of the target gene product and the interactive 
cellular or extracellular binding partner product is prepared 
in that either the target gene products or their binding 
partners are labeled, but the signal generated by the label is 
quenched due to complex formation (see, e.g., U.S. Pat. No. 
4,109,496 that utilizes this approach for immunoassays). 
The addition of a test substance that competes with and 
displaces one of the species from the preformed complex 
will result in the generation of a signal above back^ound. 
In this way, test substances that disrupt target gene product- 
binding partner interaction can be identified. 

[0178] Also, binding partners of P2X4 molecules can be 
identified in a two-hybrid assay or three-hybrid assay (see, 
e.g., U.S. Pat. No. 5^3^17; Zervos et al., Ce« 72:223-232 
(1993); Madura et al., J. Biol. Chem. 268: 12046-12054 
(1993); Bartel et al., Biotechniques 14: 920-924 (1993); 
Iwabuchi et al.. Oncogene 8: 1693-1696 (1993); and Brent 
WO94/10300), to identify other polypeptides, which bind to 
or interact with P2X4 ("P2X4-binding pol3^eptides" or 
"P2X4-bp") and are involved in P2X4 activity. Such P2X4- 
bps can be activators or inhibitors of signals by the P2X4 
polypeptides or P2X4 targets as, for example, downstream 
elements of a P2X4-mediated signahng pathway. 

[0179] A two-hybrid system is based on the modular 
nature of most transcription factors, which consist of sepa- 
rable DNA-binding and activation domains. Briefly, the 
assay utilizes two diflE^erent DNA constructs. In one con- 
struct, the gene that codes for a P2X4 polypeptide is fused 
to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a 
DNA sequence, from a library of DNA sequences, that 
encodes an unidentified polypeptide ("prey" or "Isample") is 
fused to a gene that codes for the activation domain of the 
known transcription factor. (Alternatively tiie: P2X4 
polypeptide can be the fiised to the activator domain.) If the 
"bait" and the "prey" polypeptides are able to interact, in 
vivo, forming a P2X4-dependent complex, the DNA-bind- 
ing and activation domains of the transcription factor are 
brought into close proximity. This proximity allows tran- 
scription of a reporter gene (e.g., LacZ) which is operably 
linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be 
detected and ceU colonies containing the functional tran- 


scription factor can be isolated and used to obtain the cloned 
gene which encodes the polypeptide which interacts with the 
P2X4 polypeptide. 

[0180] Identification of Candidate Therapeutics 

[0181] Candidate therapeutics for reducing fat deposition 
or treating NIDDM are identified from a group of test 
molecules that interact with a P2X4 nucleic acid or polypep- 
tide. Test molecules often are ranked according to the degree 
with which they interact or modulate (e.g., agonize or 
antagonize) DNA replication and/or processing, RNA tran- 
scription and/or processing, polypeptide production and/or 
processing, and/or function of P2X4 moleciiles, for 
example, and then top ranking modulators are selected. 
Also, pharmacogenomic information described herein can 
determine the rank of a modulator. Candidate therapeutics 
typically are formulated for administration to a subject. 

[0182] Therapeutic Treatments 

[0183] Formulations or pharmaceutical compositions typi- 
cally include in combination with a pharmaceuticaUy 
acceptable carrier a compound, an antisense nucleic acid, an 
siRNA molecule capable of inhibiting the expression of 
P2X4 or, optionally, any of its transcripts, a ribozyme, an 
antibody, or a binding partner that interacts with a P2X4 
polypeptide, a P2X4 nucleic acid, or a fragment thereof. The 
formulated molecule that is formulated may be one that is 
identified by a screening method described above. Also, 
formulations may comprise a P2X4 polypeptide, P2X4 
nucleic acid, or fragment thereof and a pharmaceuticaUy 
acceptable carrier. As used herein, the term "pharmaceuti- 
caUy acceptable carrier" includes solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic 
and absorption delaying agents, and the like, compatible 
with pharmaceutical administration. Supplementary active 
compounds can also be incorporated into the compositions. 

[0184] A pharmaceutical composition is formulated to be 
compatible with its intended route of administration. 
Examples of routes of administration include parenteral, 
e.g., intravenous, intradermal, subcutaneous, oral (e.g., inha- 
lation), transdermal (topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene glycols, 
glycerin, propylene glycol or other S3mthetic solvents; anti- 
bacterial agents such as benzyl alcohol or methyl parabens; 
antioxidaats such as ascorbic acid or sodium bisulfite; 
chelating agents such as ethylenediaminetetraacetic acid; 
buffers such as acetates, citrates or phosphates and agents for 
the adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in anipoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

[0185] Pharmaceutical compositions suitable for inject- 
able use include sterile aqueous solutions (where water 
soluble) or dispersions and sterile powders for the extem- 
poraneouis preparation of sterile injectable solutions or dis- 
persion. For intravenous administration, suitable carriers 
include physiological saline, bacteriostatic water, Cremo- 
phor EL'" (BASF, Parsippany, N.J.) or phosphate buffered 
saline (PBS). In all cases, the composition must be sterile 
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and should be fluid to the extent that easy syringability 
exists. It should be stable under the conditions of manufac- 
ture and storage and must be preserved against the contami- 
nating action of microorganisms such as bacteria and fungi. 
The carrier can be a solvent or dispersion medium contain- 
ing, for example, water, ethanol, polyol (for example, glyc- 
erol, propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such 
as lecithin, by the maintenance of the required particle size 
in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved 
by various antibacterial and antifungal agents, for example, 
parabens, chlorobutanoL, phenol, ascorbic acid, tlmnerosal, 
and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as 
manitol, sorbitol, sodium chloride in the composition. Pro- 
longed absorption of the injectable compositions can be 
brought about by including in the composition an agent 
which delays absorption, for example, aluminum 
monostearate and gelatin. 

[0186] Sterile injectable solutions can be prepared by 
incoiporating ttie active compound in the required amount in 
an appropriate solvent with one or a combination of ingre- 
dients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incor- 
porating the active compotmd into a sterile vehicle which 
contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of 
sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuima 
drying and freeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a 
previously sterile -filtered solution thereof. 

[0187] Oral compositions generally include an inert dilu- 
ent or an edible carrier. For the purpose of oral therapeutic 
administration, the active compound can be incorporated 

with excipients and used in the form of tablets, troches, or 
capsules, e.g., gelatin capsules. Oral compositions can also 
be prepared using a fluid carrier for use as a mouthwash. 
Pharmaceutically compatible binding agents, and/or adju- 
vant materials can be included as part of the composition. 
The tablets, piUs, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar 
nature: a binder such as microcrystalline cellulose, gum 
tragacanth or gelatin; an excipient such as starch or lactose, 
a disintegrating agent such as algiaic acid, Primogel, or corn 
starch; a lubricant such as magnesium stearate or Sterotes; a 
glidant sudi as colloidal silicon dioxide; a sweetening agent 
such as sucrose or sacdiarin; or a flavoring agent such as 
peppermint, methyl salicylate, or orai^e flavoring. 

[0188] For administration by inhalation, the compounds 
are dehvered in the form of an aerosol spray from pressured 
container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

[0189] Systemic administration can also be by transmu- 
cosal or transdermal means. For transmucosal or transder- 
mal administration, penetrants appropriate to the barrier to 
be permeated are used in the formulation. Such penetrants 
are generaUy known in the art, and include, for example, for 
transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be 


accomplLshed through the use of nasal sprays or supposito- 
ries. For transdermal administration, the active compounds 
are formulated into ointments, salves, gels, or creams as 
generally known in the art. Molecules can also be prepared 
in the form of suppositories (e.g., with conventional sup- 
pository bases such as cocoa butter and other glycerides) or 
retention enemas for rectal dehvery. 

[0190] In one embodiment, active molecules are prepared 
with carriers that will protect the compound against rapid 
elimination from the body, such as a controUed release 
formulation, including implants and microencapsulated 
delivery systems. Biodegradable, biocompatible polymers 
can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, coUagen, polyorthoesters, and polylactic 
acid. Methods for preparation of such formulations will be 
apparent to those skilled in the art. Materials can also be 
obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal anti- 
bodies to viral antigens) can also be used as pharmaceuti- 
cally acceptable carriers. These can be prepared according to 
methods known to those skilled in the art, for example, as 
described in U.S. Pat. No. 4,522,811. 

[0191] It is advantageous to formulate oral or parenteral 
compositions in dosage unit form for ease of administration 
and uniformity of dosage. Dosage unit form as used herein 
refers to physically discrete units suited as unitary dosages 
for the subject to be treated; each unit containing a prede- 
termined quantity of active compound calculated to produce 
the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[0192] Toxicity and therapeutic efficacy of such com- 
pounds can be determined by standard pharmaceutical pro- 
cedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the popu- 
lation) and the ED50 (the dose therapeutically efEective in 
50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD5O/ED50. Molecules which exhibit 
high therapeutic indices are preferred. While molecules that 
exhibit toxic side effects may be used, care should be taken 
to design a delivery system that targets such compounds to 
the site of affected tissue in order to minimize potential 
damage to uninfected cells and, thereby, reduce side effects. 
[0193] The data obtained from the cell culture assays and 
animal studies can be used in formulating a range of dosage 
for use in humans. The dosage of such molecules lies 
preferably within a range of circulating concentrations that 
include the ED50 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any 
molecules used in the method of the invention, the thera- 
peutically effective dose can be estimated initially from cell 
culture assays. A dose may be formulated in animal models 
to achieve a circulating plasma concentration range that 
includes the IC50 (i.e., the concentration of the test com- 
pound which achieves a half-maximal inhibition of symp- 
toms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. 
Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 

[0194] As defined herein, a therapeutically effective 
amount of protein or polypeptide (i.e., an effective dosage) 


us 2004/0018533 Al 


26 


Jan. 29, 2004 


ranges from about 0.001 to 30 mg/kg body weight, some- 
times about 0.01 to 25 mg/kg body weight, often about 0.1 
to 20 mg/kg body weight, and more often about 1 to 10 
mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6 
mg/kg body weight. The protein or polypeptide can be 
administered one time per week for between about 1 to 10 
weeks, sometimes between 2 to 8 weeks, often between 
about 3 to 7 weeks, and more often for about 4, 5, or 6 
weeks. The skilled artisan will appreciate that certain factors 
may influence the dosage and timing required to effectively 
treat a subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. More- 
over, treatment of a subject with a therapeutically effective 
amoimt of a protein, polypeptide, or antibody can include a 
single treatment or, preferably, can include a series of 
treatments. 

[0195] With regard to polypeptide formulations, featured 
herein is a method for reducing fat deposition, alleviating 
obesity and/or alleviating NIDDM in a subject, which com- 
prises contacting a P2X4 protein with one or more cells of 
a subject in need thereof, wherein the P2X4 protein is 
encoded by a P2X4 nucleotide sequence which comprises a 
polynucleotide sequence selected from the group consisting 
of (a) the polynucleotide sequence of SEQ ID NO: 1; (b) a 
polynucleotide sequence which encodes a polypeptide con- 
sisting of the amino acid sequence of SEQ ID N0:2 or SEQ 
ID NO:3; (c) a polynucleotide sequence which encodes a 
polypeptide that is 90% identical to the amino acid sequence 
of SEQ ID NO:2 or SEQ ID NO:3 or a polynucleotide 
sequence 90% identical to the nucleotide sequence of SEQ 
ID NO;l; and (d) a fragment of one of the foregoing 
polynucleotide sequences, where contacting the one or more 
cells of the subject with the P2X4 protein reduces fat 
deposition, alleviates obesity and/or alleviates NIDDM. The 
P2X4 protein often is administered to a subject prognosed as 
being at risk of fat deposition, obesity and/or NIDDM or is 
diagnosed as having obesity or NIDDM before the protein is 
administered in vivo (e.g., injected into the subject), ex vivo 
(e.g., cells from the subject are contacted with the protein in 
a petri dish and the contacted cells then are returned to the 
subject), or in vitro (e.g., cells from the subject are contacted 
with the protein in a petri dish to observe the effect of the 
protein on the ceUs). The subject often is a human. 

[0196] For antibodies, a dosage of 0,1 mg/kg of body 
weight (generally 10 mg/kg to 20 mg/kg) often is utilized. If 
the antibody is to act in the brain, a dosage of 50 mg/kg to 
100 mg/kg often is appropriate. Generally, partially human 
antibodies and fully human antibodies have a longer half-life 
within the human body than other antibodies. Accordingly, 
lower dosages and less frequent administration often is 
possible. Modifications such as hpidation can be used to 
stabilize antibodies and to enhance uptake and tissue pen- 
etration (e.g., into the brain). A method for hpidation of 
antibodies is described by Cruikshank et al., J. Acquired 
Immune Deficiency Syndromes and Human Retrovirology 
14:193 (1997). 

[0197] Antibody conjugates can be used for modifying a 
given biological response, the drug moiety is not to be 
construed as limited to classical chemical therapeutic agents. 

For example, the drug moiety may be a protein or polypep- 
tide possessing a desired biological activity. Such proteins 
may include, for example, a toxin such as abrin, ricin A, 


pseudomonas exotoxin, or diphtheria toxin; a polypeptide 
such as tumor necrosis factor, .alpha.-interferon, .beta.- 

interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response 
modifiers such as, for example, lympholdnes, interleukin-1 
("IL-1 "), interleukin-2 C'IL-2"), interleukin-6 ("IL-6"), 
granulocyte macrophage colony stimulating factor ("GM- 
CSF"), granulocyte colony stimulating factor ("G-CSF"), or 
other growth factors. Alternatively, an antibody can be 
conjugated to a second antibody to form an antibody het- 
erooonjugate as described by Segal in U.S. Pat. No. 4,676, 
980. 

[0198] For compounds, exemplary doses include milli- 
gram or microgram amounts of the compound per kilogram 
of subject or sample weight, for example, about 1 micro- 
gram per kQogram to about 500 milligrams per kilogram, 
about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 
micrograms per kdogram. It is understood that appropriate 
doses of a small molecule depend upon the potency of the 
smaU molecule with respect to the expression or activity to 
be modulated. When one or more of these small molecules 
is to be administered to an animal (e.g., a human) in order 
to modulate expression or activity of a polypeptide or 
nucleic acid of the invention, a physician, veterinarian, or 
reseaicher may, for example, prescribe a relatively low dose 
at first, subsequently increasing the dose until an appropriate 
response is obtained. In addition, it is understood that the 
specific dose level for any particular animal subject will 
depend upon a variety of factors including the activity of the 
specific compound employed, the age, body weight, general 
health, gender, and diet of the subject, the time of admin- 
istration, the route of administration, the rate of excretion, 
any drug combination, and the degree of expression or 
activity to be modulated. Candidate therapeutic molecules 
identified in screening methods described previously some- 
times are administered to cells of a subject in vivo, ex vivo, 
or in vitro to reduce fat deposition, alleviate obesity and/or 
alleviate NIDDM in the subject, and the subject often is 
prognosed or diagnosed as having such conditions before the 
candidate therapeutic is contacted with the cells. 

[0199] P2X4 nucleic acid molecules can be inserted into 
vectors and used in gene therapy methods for reducing fat 
deposition or treating NIDDM. Featured herein is a method 
for reducing fat deposition, alleviating obesity and/or alle- 
viating NIDDM in a subject, which comprises contacting a 
P2X4 nucleic acid with one or more cells of a subject in need 
thereof, wherein the P2X4 nucleic acid comprises a poly- 
nucleotide sequence selected from the group consisting of 
(a) the polynucleotide sequence of SEQ ID NO:l; (b) a 
polynucleotide sequence which encodes a polypeptide con- 
sisting of the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO:3; (c) a polynucleotide sequence which encodes a 
polypeptide that is 90% identical to the amino acid sequence 
of SEQ ID NO:2 or SEQ ID NO: 3 or a polynucleotide 
sequence 90% identical to the nucleotide sequence of SEQ 
ID NO:l; and (d) a fragment of one of the foregoing 
polynucleotide sequences, where contacting the one or more 
cells of the subject with the P2X4 protein reduces fat 
deposition, alleviates obesity and/or alleviates NIDDM. The 
P2X4 nucleic acid often is administered to a subject prog- 
nosed as being at risk of fat deposition, obesity and/or 
NIDDM or is diagnosed as having obesity or NIDDM before 
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the nucleic acid is admiaistered in vivo, ex vivo, or in vitro. 
The subject often is a human. 

[0200] Gene therapy vectors can be delivered to a subject 
by, for example, intravenous injection, local administration 
(see U.S. Pat. No. 5,328,470) or by stereotactic injection 
(see e.g., Oien et al., (1994) Proc. Natl Acad. Sci. USA 
91:3054-3057). Fhaimaceutical preparations of gene 
therapy vectors can include a gene therapy vector in an 
acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced 
intact from recombinant cells (e.g., retroviral vectors) the 
pharmaceutical preparation can include one or more cells 
which produce the gene deUvery system. Examples of gene 
delivery vectors are described herein. 

[0201] Pharmaceutical compositions can be included in a 
container, pack, or dispenser together with instructions for 
administration. Pharmaceutical compositions of active 
ingredients can be administered by any of the paths 
described herein for therapeutic and prophylactic methods 
for reducing fat deposition or treating NIDDM. With regard 
to both prophylactic and therapeutic methods of treatment, 
such treatments may be specifically tailored or modified, 
based on knowledge obtained from pharmacogenomic 
analyses described herein. As used herein, the term "treat- 
ment" is defined as the application or administration of a 
therapeutic agent to a patient, or application or administra- 
tion of a therapeutic agent to an isolated tissue or cell line 
from a patient, who has a disease, a symptom of disease or 
a predisposition toward a disease, with the purpose to cure, 
heal, alleviate, relieve, alter, remedy, ameliorate, improve or 
affect the disease, the symptoms of disease or the predispo- 
sition toward disease. Atherapeutic agent includes, but is not 
limited to, small molecules, peptides, antibodies, ribozymes 
and antisense oligonucleotides. 

[0202] A prophylactic agent often is administered prior to 
the manifestation of symptoms characteristic of the P2X4 
aberrance, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending on the 
type of P2X4 aberrance, for example, a P2X4 molecule, 
P2X4 agonist, or P2X4 antagonist agent can be used for 
treating the subject. The appropriate agent often is deter- 
mined based on screening assa)^ described herein. 

[0203] As discussed, successful treatment of P2X4 disor- 
ders can be brought about by techniques that serve to inhibit 
the expression or activity of target gene products. For 
example, compounds (e.g., an agent identified using an 
assays described above) that exhibit negative modulatory 
activity can be used in accordance with the invention to 
prevent and/or ameliorate fat deposition or NIDDM. Such 
molecules can include but are not limited to peptides, 
phosphopeptides, small organic or inorganic molecules, 
siRNA molecules or antibodies (including, for example, 
polyclonal, monoclonal, humanized, anti-idiotypic, chi- 
meric or single chain antibodies, and Fab, F(ab')2 and Fab 
expression library fragments, scFV molecules, and ^itope- 
binding fragments thereof). 

[0204] Further, antisense and ribozyme molecules that 
inhibit expression of the target gene can also be used in 

accordance with the invention to reduce the level of target 
gene expression, thus effectively reducing the level of target 
gene activity. Still further, triple hehx molecules can be 


utOized in reducing the level of target gene activity. Anti- 
sense, ribozyme and triple helix molecules are discussed 

above. 

[0205] It is possible that the use of antisense, ribozyme, 
and/or triple heKx molecules to reduce or inhibit mutant 
gene expression can also reduce or inhibit the transcription 
(triple helix) and/or translation (antisense, ribozyme) of 
mRNA produced by normal target gene alleles, such that the 
concentration of normal target gene product present can be 
lower than is necessary for a normal phenotype. In such 
cases, nucleic acid molecules that encode and egress target 
gene polypeptides exhibiting normal target gene activity can 
be introduced into cells via gene therapy method. Alterna- 
tively, in instances in that the target gene encodes an 
extracellular polypeptide, it can be preferable to co -admin- 
ister normal target gene polypeptide into the cell or tissue in 
order to maintain the requisite level of cellular or tissue 
target gene activity. 

[0206] Another method by which nucleic acid molecules 
may be utilized in treating or preventing a disease charac- 
terized by P2X4 expression is through the use of aptamer 
molecules specific for P2X4 polypeptide. Aptamers are 
nucleic acid molecules having a tertiary structure which 
permits them to specifically bind to polypeptide ligands (see, 
e.g., Osborne, et al., Curr. Opin. Chem. Biol. 1(1): 5-9 
(1997); and Patel, D. J., Curr. Opin. Chem. BioL Jun; 1(1): 
32-46 (1997)). Since nucleic acid molecules may in many 
cases be more conveniently introduced into target cells than 
therapeutic polypeptide molecules may be, aptamers offer a 
method by which P2X4 polypeptide activity may be spe- 
cifically decreased without the introduction of drugs or other 
molecules which may have pluripotent effects. 

[0207] Antibodies can be generated that are both specific 
for target gene product and that reduce target gene product 
activity. Such antibodies may, therefore, by admiaistered in 
instances whereby negative modulatory techniques are 
appropriate for the treatment of P2X4 disorders. For a 
description of antibodies, see the Antibody section above. 

[0208] In circumstances where injection of an animal or a 
human subject with a P2X4 polypeptide or epitope for 
stimulating antibody production is harmful to the subject, it 
is possible to generate an immune response against P2X4 
through the use of anti-idiotypic antibodies (see, for 
example, Herlyn, D„ Ann. Aferf.;31(l): 66-78 (1999); and 
Bhattacharya-Chatterjee, M., and Foon, K. A., Cancer Treat. 
Res.; 94: 51-68 (1998)). If an anti-idiotypic antibody is 
introduced into a mammal or human subject, it should 
stimulate the production of anti-anti-idiotypic antibodies, 
which should be specific to the P2X4 polypeptide. Vaccines 
directed to a disease characterized by P2X4 expression may 
also be generated in this fashion. 

[0209] In instances where the target antigen is intracellular 
and whole antibodies are used, internalizing antibodies may 
be preferred. Lipofectin or liposomes can be used to dehver 
the antibody or a fragment of the Fab region that binds to the 
target antigen into cells. Where fragments of the antibody 
are used, the smallest inhibitory fragment that binds to the 
target antigen is preferred. For example, peptides having an 
amino acid sequence corresponding to the Fv region of the 
antibody citn be used. Alternatively, single chain neutraliz- 
ing antibodies that bind to intracellular target antigens can 
also be administered. Such single chain antibodies can be 
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administered, for example, by expressing nucleotide 
sequences encoding single -chain antibodies within the target 
cell population (see e.g., Marasco et aL, Proc. NatL Acad. 
Sci. USA 90: 7889-7893 (1993)). 

[0210] P2X4 molecules and compounds that inhibit target 
gene expression, synthesis and/or activity can be adminis- 
tered to a patient at therapeutically effective doses to pre- 
vent, treat or ameliorate P2X4 disorders. A therapeutically 
effective dose refers to that amount of the compound suffi- 
cient to result in amelioration of symptoms of the disorders. 

[0211] Toxicity and therapeutic efBcacy of such com- 
pounds can be determined by standard pharmaceutical pro- 
cedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the popu- 
lation) and the ED^g (the dose therapeutically effective in 
50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD5q/ED5o. Compounds that exhibit 
large therapeutic indices are preferred. While compounds 
that exhibit toxic side effects can be used, care should be 
taken to design a delivery system that targets such com- 
pounds to the site of affected tissue in order to minimize 
potential damage to uninfected cells and, thereby, reduce 
side effects. 

[0212] Data obtained from cell culture assays and animal 
studies can be used in formulating a range of dosage for use 
in humans. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the 
ED50 with little or no toxicity. The dosage can vary within 
this range depending upon the dosage form employed and 
the route of administration utilized. For any compound used 
in the method of the invention, the therapeutically effective 
dose can be estimated initially from cell culture assays. A 
dose can be formulated in animal models to achieve a 
circulating plasma concentration range that includes the IC50 
(i.e., the concentration of the test compound that achieves a 
half-maximal inhibition of symptoms) as determined in cell 
culture. Such information can be used to more accurately 
determine useful doses in humans. Levels in plasma can be 
measured, for example, by high performance liquid chro- 
matography. 

[0213] Another example of effective dose determination 
for an individual is the ability to directly assay levels of 
"free" and "bound" compound in the serum of the test 
subject. Such assays may utilize antibody mimics and/or 
"biosensors" that have been created through molecular 
imprinting techniques. The compound which is able to 
modulate P2X4 activity is used as a template, or "imprinting 
molecule", to spatially organize polymerizable monomers 
prior to their polymerization with catalytic reagents. The 
subsequent removal of the imprinted molecule leaves a 
polymer matrix which contains a repeated "negative image" 
of the compound and is able to selectively rebind the 
molecule under biological assay conditions. A detailed 
review of this technique can be seen in Ansell, R. J. et al., 
Current Opinion in Biotechnology 7: 89-94 (1996) and in 
Shea, K. J., Trends in Polymer Science 2: 166-173 (1994). 
Such "imprinted" aflBnity matrixes are amenable to Ugand- 
binding assays, whereby the immobilized monoclonal anti- 
body component is replaced by an appropriately imprinted 
matrix. An example of the use of such matrixes in thLs way 
can be seen in Vlatakis, G. et al.. Nature 361. 645-647 


(1993). Through the use of isotope -labeling, the "free" 
concentration of compoimd which modulates the expression 
or activity of P2X4 can be readily monitored and used in 
calculations of IC^q. Such "imprinted" affinity matrixes can 
also be designed to include fluorescent groups whose pho- 
ton-emitting properties measurably change upon local and 
selective binding of target compound. These changes can be 
readily assayed in real time using appropriate fiberoptic 
devices, in turn allowing the dose in a test subject to be 
qmckly optimized based on its individual IC5Q. A rudimen- 
tary example of such a "biosensor" is discussed in Kriz, D. 
et al.. Analytical Chemistry 67: 2142-2144 (1995). 

[0214] Provided herein are methods of modulating P2X4 
expression or activity for therapeutic purposes. Accordingly, 
in an exemplary embodiment, the modulatory method of the 
invention involves contacting a cell with a P2X4 or agent 
that modulates one or more of the activities of P2X4 
polypeptide activity associated with the cell. An agent that 
modulates P2X4 polypeptide activity can be an agent as 
described herein, such as a nucleic acid or a polypeptide, a 
naturally-occurring target molecule of a P2X4 polypeptide 
(e.g., a P2X4 substrate or receptor), a P2X4 antibody, a 
P2X4 agonist or antagonist, a peptidomimetic of a P2X4 
agonist or antagonist, or otiier smaU molectile. 

[0215] In one embodiment, the agent stimulates one or 
more P2X4 activities. Examples of such stimulatory agents 
include active P2X4 polypeptide and a nucleic acid mol- 
ecule encoding P2X4. In another embodiment, the agent 
inhibits one or more P2X4 activities. Examples of such 
inhibitory agents include antisense P2X4 nucleic acid mol- 
ecules, anti-P2X4 antibodies, and P2X4 inhibitors. These 
modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo 
(e.g., by administering the agent to a subject). As such, the 
present invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant 
or unwanted expression or activity of a P2X4 polypeptide or 
nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by 
a screening assay described herein), or combination of 
agents that modulates (e.g., upregulates or downregulates) 
P2X4 expression or activity. In another embodiment, the 
method involves administering a P2X4 polypeptide or 
nucleic acid molecule as therapy to compensate for reduced, 
aberrant, or unwanted P2X4 expression or activity. 

[0216] Stimulation of P2X4 activity is desirable in situa- 
tions in which P2X4 is abnormally downregulated and/or in 
which increased P2X4 activity is likely to have a beneficial 
effect. For example, stimulation of P2X4 activity is desirable 
in situations in which a F2X4 is downregulated and/or in 
which increased P2X4 activity is likely to have a beneficial 
effect. Likewise, inhibition of P2X4 activity is desirable in 
situations in which P2X4 is abnormally upregulated and/or 
in which decreased P2X4 activity is likely to have a ben- 
eficial effect. 

[0217] With regard to treatment of NIDDM, featured 
herein are methods of causing or inducing a desired bio- 
logical response in an individual comprising the steps of: 
providing or administering to an individual a composition 

comprising a P2X4 nucleic acid, P2X4 protein, anti-P2X4 
antibody, P2X4 interacting molecule or another therapeutic 
composition described herein, where the biological response 
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is selected from the group consisting of: (a) modulating 
circulating (e.g., blood, serum or plasma) levels (e.g., con- 
centration) of glucose (e.g., lowering glucose levels); (b) 
increasing cell or tissue sensitivity to insulin, particularly 
muscle, adipose, liver or brain; (c) inhibiting the progression 
from impaired glucose tolerance to insulin resistance; (d) 
increasing glucose uptake in skeletal muscle cells; (e) 
increasing glucose uptake in adipose ceUs; (f) increasing 
glucose uptake in neuronal cells; (g) increasing glucose 
uptake in red blood cells; (h) increasing glucose uptake in 
the brain; and (i) significantly reducing the postprandial 
increase in plasma glucose following a meal, particularly a 
high carbohydrate meal (e.g., about 2- fold, about 10-fold 
and about 50-fold reductions). 

[0218] In further embodiments, a pharmaceutical or physi- 
ologically acceptable composition is used as an insuHn 
sensitiser. In other embodiments, a pharmaceutical or physi- 
ologically acceptable composition is used in a method to 
improve insuUn sensitivity in subjects having NIDDM in 
combination with insulin therapy, or in an alternative 
embodiment, without insulin therapy. In other embodiments, 
a pharmaceutical or physiologically acceptable composition 
described herein is utilized in a method of treating individu- 
als with gestational diabetes. "Gestational diabetes" refers to 
the development of diabetes in an individual during preg- 
nancy, usually during the second or third trimester of preg- 
nancy. In further embodiments, a pharmaceutical or physi- 
ologically acceptable composition described herein is used 
in a method for treating individuals with impaired fasting 
glucose (IFG). "Impaired fasting glucose" (IFG) is a con- 
dition in which fasting plasma glucose levels in an indi- 
vidual are elevated but not diagnostic of overt diabetes, i.e. 
plasma glucose levels of less than 126 mg/dl and greater 
than or equal to 110 mg/dl. In other embodiments, a phar- 
maceutical or physiologically acceptable composition 
described herein is used in a method for treating and 
preventing impaired glucose tolerance (IGT) in an indi- 
vidual. By providing therapeutics and methods for reducing 
or preventing IGT (i.e., for normalizing insuhn resistance), 
the progression to NIDDM can be delayed or prevented. 
Furthermore, by providing therapeutics and methods for 
reducing or preventing Insulin resistance, provided are 
methods for reducing and/or preventing the appearance of 
insulin-resistance syndrome (IRS). 

[0219] In further embodiments, a pharmaceutical or physi- 
ologically acceptable composition described herein is used 
in a method for treating a subject having polycystic ovary 
syndrome (PCOS). PCOS is among the most common 
disorders of pre-menopausal women, affecting 5-10% of this 
population. Insulin-sensitizing agents (e.g., troglitazone) 
have been shown to be effective in PCOS and improve 
defects in insulin action, insuhn secretion, ovarian ste- 
roidogenosis and fibrinolysis (Ehrman et al. (1997) J Clin 
Invest 100:1230) in insulin-resistant humans. Accordingly, 
provided are methods for reducing insulin resistance, nor- 
malizing blood glucose thus treating and/or preventing 
PCOS. 

[0220] In other embodiments, a pharmaceutical or physi- 
ologically acceptable composition described herein is used 
in a method for treating a subject having insulin resistance. 
In further embodiments, a subject having insulin resistance 
is treated according to the methods described herein to 
reduce, alleviate, or cure the insulin resistance. As insulin 


resistance often is associated with infections and cancer, 
reducing, alleviating and/or preventing insulin resistance 
according to the methods of described herein can prevent or 
reduce infections and cancer. In another embodiment, meth- 
ods described herein are used to prevent the development of 
insulin resistance in a subject (e.g., those known to have an 
increased risk of developing Insulin resistance). 

[0221] Any of the above-described tests or other tests 
known in the art can be used to determine that a subject is 
insulin resistant, and a patient thus diagnosed then is treated 
according to the methods described herein to reduce or cure 
insulin resistance. Alternatively, methods described herein 
also can be used to prevent the development of insulin 
resistance in a subject, e.g., those known to have an 
increased risk of developing insulin-resistance. 

[0222] The examples set forth below are intended to 
illustrate but not limit the invention. 

EXAMPLES 

[0223] In the following studies a group of subjects were 
selected according to specific parameters relating to fat 
deposition. Nucleic acid samples obtained from individuals 
in the study group were subjected to genetic analysis, which 
identified associations between central obesity and certain 
polymorphic regions in the P2X4 gene on chromosome 12. 
Polymorphic variations identified as being associated with 
central obesity were further screened in subjects with 
NIDDM to determine if they are also associated with the 
development of diabetes. Methods are described for produc- 
ing P2X4 polypeptide and P2X4 polypeptide variants in 
vitro or in vivo, P2X4 nucleic acids or polypeptides and 
variants thereof are utilized for screening test molecules for 
those that interact with P2X4 molecules. Test molecules 
identified as interactors with P2X4 molecules and P2X4 
variants are further screened in vivo to determine whether 
they can reduce fat deposition or treat NIDDM. 

Example 1 

Sample Selection 

[0224] In addition to simple chnical measurements, dual 
x-ray absorbtiometry (DEXA) was utilized to determine fat 
content in subjects for the genetic analysis. Central fat was 
the primary target variable, and data were collected using a 
Hologic QDR 4500 DEXA system. The central region for 
central fat determinations was defined as the region extend- 
ing from the superior surface of the second lumbar vertebra 
extending inferiorly to the inferior surface of the fourth 
lixmbar vertebra and laterally to the inner aspect of the 
ribcage. The amount of central fat and percent central fat 
was automatically calculated by the equipment and down- 
loaded into a database. 

[0225] Waist and hip measurements were generated while 
subjects were wearing underclothes and standing with their 
arms by their sides. A tape measure was utilized for these 
measurements, and care was taken to ensure that the tape 
was resting on the skin and not tig^t. Waist circumference 
was measured to the nearest centimeter at the narrowest 
point between the ihac crest and the lower edge of the ribs. 
Hip circumference was measured to the nearest centimeter at 
the widest point below the iliac crest. 
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[0226] Sample selection was restricted to female twins 
followed by the St. Thomas Hospital in England and the 
Royal North Shore Hospital in Australia. It was estimated 
that 552 dizygotic sibling pairs would yield statistical results 
of reasonable power. A further 272 unrelated individuals 
selected from monozygotic twin pairs were added to the 
sample set to incarease the probability for detecting associa- 
tions and also for testing gene-environment interactions. The 
study group was selected from this combination of dizygotic 
and monozygotic sibling pairs, referred to as the "selection 
group." 

[0227] Central fat measurements and triglyceride mea- 
surements were chosen as primary target phenotypes and 
twin pairs were selected from the selection group for 
extreme discordance and concordance. Specifically, DEXA 
measuremeats and triglyceiide measurements (colorimetric 
enzymatic method: glycerol-3-phosphate-oxidase, peroxi- 
dase, PAP (Roche), CV%=2.6, reference range less than 2.5 
mmol/L) for each individual in the selection group were 
arranged in ascending order, and individuals in the top and 
lower tenth percentile were chosen from each distribution. A 
small subset of individuals falling in the middle range of 
each distribution was chosen as a control group. In addition 
to primary phemotype trait information, samples for inclu- 
sion in the study group were selected based on data coverage 
for the following secondary phenotypes recorded by each 
individual: BMI, insulin resistance, high density lipoprotein 
in serum, waist, lipoprotein(a) in serum, insulin, hip, and 
waist/hip ratio. 

[0228] Also, presence of diabetes, thyroid disease, and 
renal disease reported by each individual were primary 
criteria for excluding subjects from the study group. Also, 
insulin levels greater than 7.1 juV /ml (Micro particle Enzyme 
Immunoassay from Abbott Laboratories Diagnostics Divi- 
sion (jMU/ml=pmol/Lx7-175) and creatine levels greater than 
160 mmol/L (measured by Jaffe method: calorimetiic test in 
which creatine reacts with picric acid in an alkaline solution 
to form a yellow-red colored complex) were also used as 
exclusion criteria as they are indicative of these diseases. 
Further excluded were pairs discordant for menopausal 
status, twin pairs where one or both of the twins were taking 
lipid lowering medication, non-fasting subjects (less than 
eight hours eating), and twin pairs including subjects treated 
with beta-blockers, thiazide diuretics, or exogeneous estro- 
gen. 

[0229] Selecting among dizygotic and monozygotic twins 
for extreme discordance or concordance for the primary 
phenotypes minimized complications associated with bivari- 
ate ranking. After applying exclusion criteria, 253 monozy- 
gotic subjects were available for inclusion, which fell short 
of the target population of 276. In reaction to this situation, 
the exhreme 201 subjects were selected from the 253 sub- 
jects and the desired numbers were reached by adding 
monozygotic unrelated individuals with data for central fat 
only, and unrelated individuals from the dizygotic cohort 
with data for triglycerides only. Samples available for final 
selection for the 552 dizygotic pairs included 178 pairs 
extreme for both traits, 205 extreme for tr^lycerides only, 
and 208 for central fat only. 

[0230] In this test population, coverage for the secondary 
phenotypes ranged from 67% to 90%. In total, 61% of 
subjects had coverage for all primary and secondary phe- 


notypes. A broad age spectrtim was also represented, and 
niunbeis of pre-menopausal and post-menopausal subjects 
were relatively evenly distributed. 

Example 2 

Association of Polymorphic Variations to Fat 

Deposition 

[0231] Blood samples were taken from individuals in the 
study population described in Example 1. Genomic DNA 
was extracted from these blood samples using standard 
techniques (BACC2 DNA extraction kit (Nucleon Bio- 
sciences)) and subjected to analysis. Based upon a back- 
ground Unkage study and fine mapping analysis by micro- 
satelKte markers, it was postulated that genetic elements 
linked to central fat deposition were located on the 12q24 
region of chromosome twelve. One of the genes located in 
this region encoded P2X4. 

[0232] Whole genome linkage scans were performed for 
the purpose of identifying genomic regions likely to harbor 
genes with a major contribution to deposition of central fat. 
The linkage scans were performed using highly polymorphic 
microsatellite markers (Reed et al., Nat Genet 7:390-5 
(1994)) and DNA samples obtained from 1100 Caucasian 
female twin pairs from the UK. Samples selected for inclu- 
sion in the study cohort encompassed a broad spectrum of 
phenotypic trait values, ranging from lean to obese subjects. 
Initial studies were carried out using 400 commercially 
available microsatellite markers derived from the Genethon 
linkage map, with an average genomic spacing between 
markers of approximately 10 cM (ABI Prism linkage map- 
ping set, version 2 from PE Applied Biosystems). 

[0233] Multipoint nonparametric linkage analysis was 
performed using MAPMAKER/SIBS (Kniglyak & Lander, 
Amer. J. Human Genetics 57:439-454 (1995)). A bioinfor- 
matics infrastructure and software packages described in 
WO 00/51053 were used in the linkage study to record 
marker positions, store data and generate data files. Output 
from these systems was then used with relevant application 
software to perform the statistical analysis. 

[0234] Genotyping reactions were generally carried out in 
microtitre plates (384-well, reaction volume 5 //I), contain- 
ing 12.5 ng of DNA from study subjects was amplified using 
PGR and sequence specific ol:^onucleotide primers labeled 
with 6-FAM™, HEX™, or NED™ fluorescent dyes. PGR 
products were analyzed by electrophoresis in a polyacryla- 
mide denaturing gel, with an ABI PRISM^^ GENES CAN® 
400HD ROX labeled size standard in each lane on an ABI 
model 377 analyzer (Applied Biosystems, Foster City, 
Calif.). For genotyping, the chosen markers were divided 
into two groups (panels) so that the analysis of all of the 
maricers cotjld be performed in two electrophoresis runs of 
each sample. Consequently, there was no overlap of frag- 
ment sizes in any one dye for either of the panels. Genotype 
analysis was performed using ABI PRISM^" GENES- 
CAN® software (version 3.0), and genotyped manually 
using ABI PRISM™ Genotyper 2.0. Results were input into 
a database and binned by marker. The results were quality 
checked, ensuring consistent inheritance within families. 
Families that were found to have consistent pedigree prob- 
lems were excluded from the analysis set. 
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[Q235] The ordering of genetic mapping markers (i.e. 
microsatellite markers) was relatively stable in the region 
analyzed according to the Unified Data Base for Human 
Genome Mapping, Weizmann Institute of Science (UDB) 
and National Center for Biotechnology Information, 
National Institutes of Health (NCBI) assemblies durii^ the 
duration of the study. Conversion of genetic to physical 
positioas for strategic microsatellite markers was performed 
using UDB and NCBI as the reference standards. Compari- 
sons of the identity and positioning of genomic contigs in the 
region also were made between UDB and NCBI and pro- 
vided relatively good agreement. A comparison of the posi- 
tioning of aU identified and predicted genes within the 
region also was made between NCBI (build 22) and Joint 
Project between European Bioinformatics Institute and the 
Sanger Centre (ENSEMBL). 

[0236] Microsatellite marker analysis showed Unkage on 
the long arm of chromosome 12 to central fat deposition, 
percent central fat and total fat in the region spanning 125 
cM to 155 cM, with a peak non parametric Z score of 3.6 for 
central fat. The region was narrowed further to identify the 
chromosomal interval 12q24 as being the primary region 
harboring genes contributing to central fat deposition using 
the following higihly polymorphic microsatellite markers: 
D12S86, D12S1612, D12S1614, D12S340, D12S324, 
D12S1675, D12S1679, D12S1659 and D12S97. 

[0237] The chromosome 12q24 region then was analyzed 

using single nucleotide polymorphisms to identify genes in 
the region that regulate central fat deposition. Potential 
polymorphisms in the P2X4 polynucleotide were identified 
in a publicly available SNP database (see http address 
www.ncbi.nkn.nih.gov/SNP) and were verified in a group 
other than the study group. Polymorphisms verified as 
statistically significant SNPs (i.e., minor allele represented 


in more than 10% of the population) were genotyped in the 
study population to determine associations with fat deposi- 
tion. A procedure for detecting polymorphisms was utilized 
in the verification and genotyping studies, described here- 
after. Table 1 shows the majority of polymorphisms sub- 
jected to genotype analysis, where "np" indicates that the 
SNP is not present in the sequence of SEQ ID NO:l and "-" 
indicates that there is no reference SNP ID number. Refer- 
ence SNP number rsl044251 is now called rs25643, and 
corresponds to the same SNP described previously by 
rsl044251 (see http address www.ncbi.nlm.nih.gov/SNP). 


TABLE 1 



Position in 

Position in 



Reference 

SEQ ID 

Ensembl 

Position in NCBI 


SNP ID 

NO: 1 

Database 

Database 

Allelic ^^liabllity 

IS9301S7 

7337 

4577 

7336 

A/G 

isll 82945 

11030 

8270 

11029 

T/A 

1S1044251 

15847 

13087 

15846 

T/C 

(IS25643) 





IS725135 

17338 

14577 

17336 

T/C 

IB25644 

21708 

19093 

21699 

A/G 

IS1151880 

22713 

2009S 

22704 

C/T 


14744 

11984 

11744 

C/T 


[0238] Assays for Verifying and Genotyping SNPs 

[0239] An assay utiUzed for determining whether a poly- 
morphic variation was present in a nucleic acid sample 
involved a sequencing by synthesis procedure. DNA poly- 
merase, ATP sulfiirylase, hidferase, apyrase, luciferin, and 
adenosine 5'-phosphosulfate (APS) were required, and in the 
process, one dNTP was added to an extension oligonucle- 
otide at a time and then degraded if not incoroporated in the 
synthesized strand. Incorporation of a dNTP to the end of the 
extension oligonucleotide was detected by light emission. 

[0240] The assay was carried out by first amplifying a 
region of interest in the sample by using a polymerase chain 
reaction (PCR) that incorporated the primers set forth in 
Table 2. 

TABLE 2 


SEQ SEQ 
ID ID 
no : Reverse PCR primer NO : 

GGTTTCATGCCACTCATTATCA 

CCCTTCCTCCTGACTTTTATG 

AAAACAAAACA2UUU3GCACTTC 

CCAAGCTAGAGTGCAGAAGAG 

AGGGCTGG6TCAGGA6AS 

GCCCTAAATTCAGTGACTGATAT 


[0241] A typical PCR reaction included 14.24 fA of water, 
2.23 fd of PCR buffer, 1.38 f*l of 1.5 mM MgCl^, 1.12 ^1 of 
0.125 mM dNTPs, 0.45 fA of the forward primer at a 0.2 

concentration, 0.45 /A of the reverse primer at a 0.2 /^M 
concentration, 0.13 of Taq polymerase (0.003 U//il), and 
2.3 ^1 of DNA sample at a 0.2 ng/j«l concentration, for a total 
volume of 22.3 ^uL The PCR reaction normally was carried 
out using one step at 95° C. for 10 minutes; 45 cycles at 95" 
C. for 30 seconds, 60* C. for 45 seconds, and 72** C. for 45 
seconds; one step at 72" C. for 5 minutes; and then finalizing 
the reaction at 22° C. 


Position 

Reference in SEQ 

SNP ID ID NO : 1 Forward PCR primer 


rs9 3018 7 7337 AACCCTCCCTCAAAAAAAGAC 

rBll82945 11030 GGAAAACAAGAACTCCAGAAACT 

rsI044251 15847 TGCCACCCTTGTGCTTGTA 

(rs25643) 

rs725135 17338 CTGAGGCAGTCGGATCATC 

rs25644 21708 GAGGTTTTCGTCGCTCTGAT 

rs 1151880 22713 TCACTCCAGTCTCTGCCTCTC 
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[Q242] After the PGR reaction was completed, an exten- 
sion ot^onucleotide was hybridized to the FCR product. 
Extension oligpnucleotides are reported in Table 3. 

TABLE 3 

Poei-bion in 
SEQ 

ID NOs 1 Sx-bension Ollgonucleo-bida SEQ ID NOs 
73 37 AAA2UU^AGACCGAACAC 
110 30 CATTAGCTCGGTCTCC 
158 47 GCTTTCAACGGGTC 
17338 TAGCTGGGATTACAGAC 
21708 CATGTCCTGGAAAC 
22713 GCCTTCTGCCTTGTA 


[0243] The extension oligonucleotide was complementary 
to the amplified target up to but not including the polymor- 
phism, and was extended enzymatically one or a few bases 
through the polymorphic site. In the extension phase of the 
assay, a single dNTP was added to the reaction, and pyro- 
phosphate was generated if the dNTP was added to the 
extension oligonucleotide. ATP sulfurylase present in the 
reaction mixture utilized the pyrophosphate in conjunction 
with APS to generate ATP. ATP drove the luciferase-cata- 
lyzed conversioo of luciferia to oxyluciferin, which gener- 


ated the release of visible light measured by a CCD camera. 
A graphic representation was generated showing a peak 
corresponding to the amount of light emitted, where the light 
was proportional to the amount of nucleotide incorporated 
into the extension oligonucleotide. dATP was not added to 


the reaction, and instead, was replaced by dATPyS, which 
was not turned over by luciferase. Apyrase was added to the 
reaction to degrade unicorporated dNTP and ATP sulfury- 
lase-generated ATP, and when the apyrase reaction was 
complete, another dNTP was optionally added to the reac- 
tion for another extension phase. 

[0244] An altemative assay involved a MassARRAV™ 

system (Sequenom, Inc.), which was utilized to perform 
SNP genotyping in a high -throughput fashion. This geno- 
typing platform was complemented by a homogeneous, 
single-tube assay method (hME'^** or homogeneous Mas- 
sEXTEND™ (Sequenom, Inc.)) in which two genotyping 
primers anneal to and amplify a genomic target surrounding 
a polymorphic site of interest. A third primer (the MassEX- 
TEND™ primer), which is complementary to the amplified 
target up to but not including the polymoiphism, then was 
enzymatically extended one or a few bases through the 
polymorphic site and then terminated. 

[0245] For each polymorphism, SpectroDESIGNER™ 
software (Sequenom, Inc.) was used to generate a set of PGR 
primers and a MassEXTEND™ primer was used to geno- 
type the polymorphism. Table 4 shows PGR primers and 
Table 5 shows extension primers used for analyzing poly- 
morphisms. The initial FCR amplification reaction was 
performed in a 5 |<1 total volume containing IxPCR buffer 
with 1.5 mM MgCl, (Qiagen), 50 |/M each of dATP, dGTP, 
dCTP, dTTP (Gibco-BRL), 2.5 ng of genomic DNA, 0.1 
units of HotStar DNA polymerase (Qiagen), and 200 nM 
each of forward and reverse PGR primers specific for the 
polymorphic region of interest. 

TABLE 4 


SEQ 
ID 
NO: 


[0246] Samples were incubated at 95® C. for 15 minutes, 
followed by 45 cycles of 95** C. for 20 seconds, 56* C. for 
30 seconds, and 72° C. for 1 minute, finishing with a 3 
minute final extension at 72° C. Following amplification, 
shrimp alkaline phosphatase (SAP) (0.3 units in a 2 /il 


Referencf 
SNP ID 


Position 
in SEQ 

ID NO:l Forward PGR primer 


SEQ 
ID 

NO: Reverse PCR primer 


rsl044251 
(£-825643) 


rell51880 

rs208307 

r8l621388 

rs26S6384 

rBll82945 

rsl653586 

rsl653589 

£-81063843 


7337 ACGTIGGATGACCCTCCCTCAAAAA 
AAGAC 

15847 ACGTTGGATGTTGTAGGAGTCTCAA 
CAGGC 

21708 ACGTTGGATGGGCAAAATAGTGGA 

22713 ACGTIGGATCTCCTGTCATGiWKSCC 
TTCTG 

np TCCTGGGAAAGAGACASATC 

np TGCCTGGCTTCAGTAAGGAC 

np TGGTTCCTGAGTTTAAAGCC 

11030 CTCAAGAGAAGGGTCTTGTG 

np TCCOGACCCTTTCTACTATG 

np TATATACAGGCGGGCATAGG 

np GAAACAGAAGTGACTGCCTG 


ACGTTGGATGCATGCCACTCA 
TTATCATGTG 

ACGTTGGATGACGTGTGTGTC 
ATCCTCCAC 

ACGTTGGATGAGTTGACCTGG 
ATGCCCATG 

ACGTTGGATGGGCCCTAAATT 
CAGTGACT6 

GTGATCTTTAAACCTGGCAG 
CGCTGGAGGATCCGGAAAGA 
AAAAACCAGCAGCTTTCTAC 
CCTCCTGACTTTTATGGCTC 
TGTGGAAAAACACGAAAGGC 
TTCACAGCTTCAGGCATGTG 
CACTGCCCTTCCCACTGAG 
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volume) (Amersham Pharmacia) was added to each reaction 
(total reaction volume was 7 /^l) to remove any residual 
dNTPs that were not consumed in the PGR step. Samples 
were incubated for 20 minutes at 37** C, followed by 5 
minutes at 85*" C. to denature the SAP. 

[0247] Once the SAP reaction was complete, a primer 
extension reaction was initiated by adding a polymorphism- 
specific MassEXTEND™ primer cocktail to each sample. 
Each MassEXTEND'^'*'' cocktail included a specific combi- 
nation of dideoxynucleotides (ddNTPs) and deoxynucle- 
otides (dNTPs) used to distinguish polymorphic alleles from 
one another, bi Table 5, dNTPs are shown and the fourth 
nucleotide not shown is the ddNTP (e.g., in the first row A, 
C and T are dNTPs and G is the ddNTP). 


[0250] SNP Verification 

[0251] Polymorphisms identified in the publicly available 
database were verified by detecting the presence or absence 
of each polymorphism across six individuals from Sweden 
(including PGR negative control and one sequence primer 
extension control). Where a polymorphism was present in 
two or more of the individuals, the polymorphism was 
designated as a statistically significant SNP and genotyped 
across the test population. Where the polymorphism was not 
identified in any of the six individuals, it was further 
examined in a population of thirty Gaucasian blood donors 
from Sweden. In this group of tihirty individuals, a poly- 
morphism having a frequency of 10% or greater was des- 
ignated as a statistically significant SNP and genotjrped 
across the test population. The probability of not identifsdng 


TABLE 5 

Reference SNP 
ID 

Pos±-blon In. 
ID nO:l 

SEQ 

Ext;end Probe SE 

T ermlna-tion 
]Q ID NO: Mix 

rs930187 

7337 

CCTCAAAAA2U^CCGAACAC 

A,C,T 

rBl044251 
(rs25643) 

15847 

GCCTAGCTTTCAACGGGTC 

A,C,6 

r-B25644 

21708 

ACOGCCATGTCCTGGAAAC 

A,C,G 

rsllSlBSO 

22713 


A,C,T 

rs208307 

ne 

TGGCAGGATGTTTCTCCTCT 

A,C,T 

E-B1621388 

n.B 

AC3GA.TCCGGAAAGAISTTTCC 

A,C 

rs2686384 

US 

CAGCTTTCTACTATATTCATGTAA 

A,C 

rsll82945 

ns 

GCTCATTAGCTCGGTCTCC 

C,T 

rsl653586 

ns 

GGAAAAACACGAAAGGCTTAGTT 

C,T 

rsl653589 

ns 

TCAGGCATGTGATCAGTTAGAGCC 

A,C,T 

rBl063B43 

n.6 

CCTTCCCACTGAGCACATC 

A,C,T 


[0248] The MassEXTEND™ reaction was performed in a 
total volume of 9 /ul, with the addition of Ix ThermoSeque- 
nase buffer, 0.576 units of ThermoSequenase (Amersham 
Pharmacia), 600 nM MassEXTEND^" primer, 2 mM of 
ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 
mM of dATP or dCTP or dGTP or dTTP. The dideoxy (dd) 
nucleotide used in the assay was complementary to the 
nucleotide at the polymorphic site in the ampUcon. Samples 
were incubated at 94° C. for 2 minutes, followed by 45 
cycles of 5 seconds at 94** C., 5 seconds at 52® C, and 5 
seconds at 72* C. 

[0249] Following incubation, samples were desalted by 
adding 16 /il of water (total reaction volume was 25 fil), 3 mg 
of SpectroCLEAN™ sample cleaning beads (Sequenom, 
Inc.) and allowed to incubate for 3 minutes with rotation. 
Samples were then robotically dispensed using a piezoelec- 
tric dispensing device (SpectroJET™ (Sequenom, Inc.)) 
onto either 96 -spot or 384-spot silicon chips containing a 
matrix that crystallized each sample (SpectroCHIP''"* 
(Sequenom, Inc.)). Subsequently, MALDI-TOF mass spec- 
trometry (Biflex and Autoflex MALDI-TOF mass spectrom- 
eters (Bruker Daltonics) can be used) and SpectroTYPER 
RT'^'^ software (Sequenom, Inc.) were used to analyze and 
interpret the SNP genotype for each sample. 


a minor allele variant represented in 10% or more of a 
population was calculated as being about 0.2% when 
samples from 30 individuals are analyzed, where it was 
estimated that 19% of individuals in the total population 
would be carriers for the minor allele assuming a large 
population and no selection pressure. 

[0252] The following polymorphisms reported in the 
dbSNP database were identified as being polymorphic (i.e., 
statistically significant) in the verification studies: rs208307, 
rsl063843, rsll82945, rsl621388, rsl653586, rsl653589, 
rs26863S4, rsl044251 (rs25643), rs25644, and rsllSlSSO. 
Polymorphisms reported in the dbSNP database as rs96064, 
rsl044249, rs2303998, and rs2288690 were identified as not 
being polymorphic, where the first three were tested in 
seventeen individuals and where the fourth was tested in 30 
individuals described in Example 3. Also, a SNP at position 
14744 in SEQ ID NO:l, which was not reported in the 
dbSNP database, was identified in sequencing/genotyping 
studies described in Example 3 using the primers referred to 
in Table 8 as "Exons 3&4" primers. 

[0253] Genotype Analysis 

[0254] Among the verified SNPs, Table 6 depicts three 
SNPs that were strongly associated with fat deposition. 
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Allele frequency is noted in the second column and the allele 
indicated in bold type is the allele associated with increased 
central fat deposition. Statistical significance of each asso- 
ciation was determined by the computer program QTDT 
(Abecasis et al., Amer. J Human Genetics, 66: 279-292 
(2000)). Transmission/disequilibrium tests (TDT), popula- 
tion stratification tests, and total association tests were 
utilized in this computer program. Because population strati- 
fication could have lead to spurious results fi-om total 
association testing, association testing was considered only 
when there was no evidence of population stratification. 

TABLE 6 


SNP Position in 


SEQ ID NO: 1 

Allele Frequency 

Statistical Significance 

11030 

A = 0.67 

p - 0.0168 


T = 0.33 


15847 

T = 0.55 

p - 0.0317 


C = 0.45 


1733S 

C - 0.76 

p - 0.0146 


T = 0.24 


14744 

C = 0.55 

p = 0.0000018 


T-0.45 



[0255] SNP 14744 was originally genotyped over 30 dizy- 
gotic and monozygotic twins fi-om the St. Thomas and Royal 
North Shore samples described in Example 1, and was found 
to be associated with fat deposition. Upon genotype analysis 
over the remaining St. Thomas and Royal North Shore 
samples (N=1047), the original association with fat deposi- 
tion was not found. The minor allele frequency (T of the T/C 
SNP) was 0.16 and the p-value was 0.183 (QTDT analysis). 

[0256] Haplotype analysis was performed using a program 
known as QPDT (Martin et al.,Amer. J. Hitman Genetics, 
67: 146-54 (2000)), which utilizes the EM algorithm 
(Dempster et al.,/. Royal Statistical Soc, 539: 1-38 (1977)). 
The program was utilized to assign haplotypes based on 
likelihood of maximization. Table 7 shows possible haplo- 
types for four SNPs in the P2X4 gene and estimated fre- 
quencies for each. 

TABLE 7 


Nucleotide Position in SEQ ID NO: 1 

Allele 11030 15847 21708 22713 Frequency 

HI T T A T 0.54949 

H2 A C A T 0.21584 


TABLE 7-continued 



Nucleotide Position 

in SEQ ID NO: 1 


Allele 

11030 15847 

21708 22713 

Frequency 

H3 

T C 

A T 

0.12152 

H4 

A C 

G C 

0.10964 


[0257] Haplotype versus single position association analy- 
sis for the F2X4 gene suggested that the HI haplotype and 
H3 haplotype were most significantly associated with lean- 
ness. 

Example 3 

Association of an Insertion in P2X4 with Fat 
Deposition 

[0258] The genetic analysis described in Example 2 iden- 
tified three SNPs associated with fat deposition. Further 
studies were conducted to identify new SNPs or other 
polymorphic variations associated with fat deposition in the 
first six exons of P2X4. Samples from 30 individuals of the 
study group were utilized in this study where 10 individuals 
were homozygous for the HI haplotype in Table 7, 10 
individuals were homozygous for the H2 haplotype, and 10 
individuals were homozygous for the H4 haplotype. The 
haplotypes were chosen for several reasons. First, choosing 
individuals from the same haplotype would increase the 
probability of identifying a SNP in Linkage disequilibrium 
with other SNPs in the haplotype associated with fat depo- 
sition. Second, individuals homozygous for the HI haplo- 
type were characterized as having significantly decreased 
central fat deposits compared to individuals homozygous for 
the H2 and H4 haplotypes. Therefore, the three haplotypes 
represent two distinct groups of individuals, and therefore 
were used to determine differences in fat deposition for 
newly discovered SNPs. Exons 1-6 in the P2X4 nucleic acid 
were sequenced in the following manner. 

[0259] A 50 fA PGR reaction was prepared to amplify a 
region to be sequenced. Typical reagent concentrations were 
Ix Qiagen PGR buffer, 1.5 mM MgGl^, 200 ,uM dNTPs, 200 
nM forward primer, 200 nM reverse primer, 1 Unit HotStar- 
Taq polymerase (Qiagen), and 25 ng genomic DNA. For- 
ward and reverse primers used for the analysis are shown in 
Table 8. Primers were utilized in sequencing and PGR 
reactions, except pairs marked with an asterisk (*), which 
were utilized only in sequencing reactions, and pairs marked 
with a double asterisk (* *), which were utilized only in PGR 
reactions. 


Forward Primer Reverse grimer 

Location Location 

Region in in SEQ ID SEQ ID in SEQ ID 

P2X4 NO:l Sequence IJO: NO:l Sequence 

Exon 1 3 361- GAGACGAGGTCTCGC 2 65 5- CCCAGGCTGTGG 

3344 TATGT 267 4 TTTTAA 

Exon 1* 2990- GGGACCCAGACCGAC 298 4- CAGTCCCCTAGT 

3008 TAG 3001 CGGTCTG 
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TABLE 8 -continued 


Forward Primer 


Reverse Primer 


Loca-tion 
Region in in SEQ ID 
P2X4 HOsl Sequence 


Loca-tion 
SEQ ID in SEQ ID 


Exon 2 10314— 
10333 


TCCCCTAGAAGGTAC 


9855- 
9874 


CTGCCTGQATAC 
ACTCTGTT 


Exons 3S4 15 107— 
15125 


GCGGTCAGTGTTTGA 
GTTG 


14675- 
14693 


TGTGAAGGAATT 
GGGAAGA 


16074- 
16092 


15636- 
15656 


GTGGTGGGTGCC 
TATAATC 


22042- 
22059 


■Not Used GCCTGAGTGCTG 
GTCTTC 


21705- 
21722 


AACATGGTTATGTGA 
GTATTTG 


21336- 
21357 


CATGTCCTGGAA 
ACTGTG 


CCTCAAGTCGTGCAT 


designed TTA.TG 


Pramoi;er 
Sequence 

Promoter 
Sequence 

Promoter 
Sequence 

Promoter* 
Sequence 

Promoter* 
Sequence 

Promoter* 
Sequence 


1275- 
1295 


1994- 
2014 


2772- 
2790 


918- 
936 


1642- 
1659 


2352- 
2370 


2900- 
2917 


CCAAAGTGCTGGGftT 

TAAA 


CCAGGAAATGAGCTA 

AAAATG 


C AC AT GC GGA2iAATT 
GTATTC 


GAGGATTTAGCAATG 
GAGTTT 


GCCTGTAATCCCAAC 
ACTT 


TACCCCTCACTTGAA 
CTCTG 


CGAGTTCCATGCOCC 
CTAGAG 


21623- 
21642 


464- 
482 


1210- 
1230 


1913- 
1938 


857- 
877 


2322- 
2341 


2820- 
2840 


None designed 


GTCTCTAACCGC 

TCTCATCTG 


CCACTGAGAAA 
ATGCTATCAC 


CGAAAGGCTCCT 
TCAAAAG 


TAICTGCCCAAA 
TGGAGTT 


CTCCTGACCTCC 
CATGAT 


GCGAAGAGAGA 
GTGACCTC 


CCCGGAAGGAG 
GCAGCAC 


27042— 
27061 


CCTGGGACC&liCTTG 
AGA 


26283— 
26300 


GAGCCGGJU^TA 
GTGGATTTA 


ExonB 9 to26003— 
11 26022 


TGTGCCAGCTCCACT 
CTAAC 


25200- 
25219 


GGCTCCCACACC 
CAAGAGAT 


Exone 9 to25662— 
11* 25680 

Exon 12* 2S744- 

26763 


TCATCCCCACTATGA 
TCAACAT 


CCAGGAGGTCAGCAG 
TCTGT 


25499- 
25520 


26606- 

26625 


GGCCAAGGGAA 
AGAAGGTA 


CCTGGAGAGAG 
CTGGAGACA 


Exon 1** Same as 
2820- 


CAGTAATACGACTCA 
CTATAGGGAGAAGGC 
2 840, butTCGAGTTCCATGOCC 
with 5' CCTAGAG 
SP6 

sequence 


Same as CGATTTAGGAGA 
2 82 0- C AC T ATAGAAGA 

2 840, butGCCCGGAAGGA 

with 5 • T7GGCAGCAC 

sequence 


[0260] 5 /Lil of each PGR product was analyzed on an 
agarose gel to confirm the size and purity of the amplified 
product. PGR products were purified using a Mo Bio Ultra 
Clean PGR kit. 45 /A of PGR product was mixed with 225 
fil of Spin Bind buffer. The mixture was loaded onto a filter 
column and centrifiiged for 30 seconds at 13,000 rpm. The 
solution was decanted. 300 jul of Spin Clean buffer was 
added to the filter column and centrifuged for 30 seconds at 
13,000 rpm. The solution was decanted and the column was 


centrifuged again at 13,000 rpm for one minute. The filter 
column was placed in a new tube and 30 //I of water was 
added to the center of the filter column. The column was 
centrifuged for one minute at 13,000 rpm to elute DNAfrom 
column. 

[0261] Sequencing reactions were performed with the 
following components: 2 jul of purified PGR product, 1 fil of 
sequencing primer at a 3.2 fiM. concentration, and 2 fil of 
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Applied Biosystems BigDye version 3.0 Ready Mix. The 
sequencing reaction was cycled with the following tempera- 
ture program: (a) 96° C. for 15 seconds; (b) 55° C. for 15 
seconds; (c) 60° C. for 4 minutes; (d) repeat steps (a) through 
(c) 24 times; and (d) 4° C. for completing the reaction. 

[0262] Excess dye tenmnatois were removed by precipi- 
tating the reactions. 25 //I of a sodium acetate-ethanol 
solution (1 ,m1 of 3M sodium acetate, pH 4.6 per 25 id of 
100% ethanol) was added to each sequencing reaction. 
Reactions were centrifuged at 4,000 rpm for 30 minutes and 
the sohition was decanted. 

[0263] ICQ jUl of 70% ethanol was added to each reaction 
and centrifuged at 4,000 rpm for 10 minutes and the solution 
was decanted. 4 fA of loading buffer was added to each 
sample, and samples were vortexed. Loading buffer was a 
mixture of 1 part Blue Dextran/EDTAto 5 parts form amide. 
Samples were heated to 95° C. for 5 minutes to denature, 
then cooled to 4° C. 1.7jm1 was loaded onto a 5% acrylamide, 
6M urea gel and run for 7 hours on an Applied Biosystems 
Model 377 sequencer. 

[0264] Prior to receiving nucleic acid samples for the 30 
individuals, 17 random samples were chosen for sequendog 
of P2X4 exons 1 to 6. A seven-base pair insertion located in 
exon 1 within positions 2878 to 2891 of SEQ ID NO:l was 
a newly identified polymorphism. In a majority of samples 
the sequence spanning positions 2878 to 2891 in SEQ ID 
NO:l had two GGGCCCC repeats such that the sequence 
was 5'-GGGCCCCGGGCCCC-3' (SEQ ID NO:). A GGGC- 
CCC insertion was identified in a minority of the samples, 
thereby yielding three GGGCCCC repeats in individuals 
having the insert instead of the two repeats shown in SEQ ID 
NO:l, which led to the sequence 5'-GGGCCCCGGGC- 
CCCGGGCCCC-3' (SEQ ID NO:) instead of 5'-GGGC- 
CCCGGGCCCC-3' (SEQ ID NO:). The frequency of the 
insertion was 2.9% in the 17 samples, which corresponds to 
one insertion out of 34 alleles. In the group of 30 samples, 
the insertion allele frequency was 32.8% (19 out of 58 
alleles, sequences were not interpreted for one individual). 

[0265] Within the 30 samples, eight individuals were 
homozygous for the insertion and three were heterozygous 
for the insertion. Central fat values for the homozygous 
insertion individual versus homozygous deletion individuals 
were compared to one another using the Student's T-test, and 
it was determined that individuals homozygous for the 
insertion had a significantly higher prevalence of central fat 
deposition (p=0.0064). Specifically, individuals homozy- 
gous for the insertion (N=9) possessed an average central fat 
value of 1993.42 grams (SEM=178.04), whereas individuals 
homozygous for the deletion (N=18) had an average central 
fat value of 892.51 grams (SEM=184.21). Thus, the percent 
increase in central fat deposition between the two groups 
was 53.66%, significantly showing the importance of the 
insert for diagnosing a predisposition to fat deposition. 

[0266] This result, however, did not replicate when the 
insertion/deletion polymorphism was genotyped in a larger 
population. The insertion/deletion polymorphism located at 
position 2878 was originally genotyped over 30 dizygotic 
and monozygotic twins from the St. Thomas and Royal 
North Shore samples described in Example 1, and was found 
to be associated with fat deposition as described above. 
Upon genotype analysis over the remaining St. Thomas and 
Royal North Shore samples (N=1011), the original associa- 


tion with fat deposition was not found. The minor allele 
frequency (Insertion of the Insertion/Deletion polymor- 
phism) was 0.09 and the p-value was 0.252 (QTDT analy- 
sis). 

Example 4 

NIDDM Sample Selection 
[0267] Pooling Strategies 

[0268] Samples were placed into one of four groups based 
on disease status. The four groups were female case samples, 
female control samples, male case samples, and male control 
samples. A select set of samples lirom each group were 
utilized to generate pools, and one pool was created for each 
group. Each individual sample in a pool was represented by 
an equal amount of genomic DNA. For example, where 25 
ng of genomic DNA was utilized in each PCR reaction and 
there were 200 individuals in each pool, each individual 
would provide 125 ^//g of genomic DNA. Inclusion or 
exclusion of samples for a pool was based upon the follow- 
ing criteria and detailed in the tables below. Selection 
criteria for the study described herein included patient 
ethnicity and diagnosis with NIDDM. Other phenotypic data 
collected included GAD antibody concentration, HbAlc 
concentration, body mass (BMI), patient age, date of pri- 
mary diagnosis, age of individual as of primary diagnosis 
(See Table 9A below). Cases with elevated GAD antibody 
titers and low age of diagnosis were excluded to increase the 
homogeneity of the diabetes sample in terms of imderlying 
pathogenesis. Controls with elevated HbAlc were excluded 
to remove any undiagnosed diabetics. Control samples were 
derived from non-diabetic individuals with no family history 
of NIDDM. Secondary phenotypes were also measured in 
the diabetic cases, phenotypes such as HDL, LDL, tr^yc- 
erides, insulin, C-peptide, nephropathy status, neuropathy 
status, to name a few, which will allow secondary analysis 
of the cases the be performed in order to elucidate the 
potential pathway of the disease gene. 


TABLE 9A 


No. of 

Actual no. 



individuals 

of samples 



fulfilling 

excluded 

No. of 


exclusion 

after eadi 

Eanqjles 

Exclusion Criteria 

criteria 

stage 

remaining 

ALL SAMPLES Ladt of avaUabfl- 

34 

34 

1591 

ity of sample 




ALL SAMPl^ Non-Geiman eth- 

261 

239 

1352 

nicity 




CASES GAD Ab >0.9 

102 

84 

1268 

CONTROLS HbAlc >6 or 

21 

20 

124S 

BMI >40 




CASES age <90 

17 

6 

1242 

CASES Age of Diagaosis <35, 

150 

203 

1039 

CONTROLS Family Histoiy of 

170 



Diabetes 




CONTROLS A^-matching to case 

43 

43 

996 






[0269] The selection process yielded the pools set forth in 
Table 9B, which were used in the studies described herein. 
It should be noted that the diabetic cases are borderline 

obese and the controls are slightly overweight on average, 
where the World Health Organization classifies a BMI less 
than 25 as lean, and a BMI greater than 30 as obese. 
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TABLE 9B 



Female 


Male 

Male 
cantiol 

Pool size 

244 

244 

254 

254 

(Number) 





Pool Criteria 

case 

control 

case 

control 

(ex: casB/controI) 





Mean Age 

52.49 

49.02 

49.78 

50.57 

(ex: years) 





BMI (kg/m^^) 

31.61 

25.23 

30.02 

2553 


Example 5 

Association of Polymorphic Variations with 
NIDDM 

[0270] Blood samples were taken from individuals in the 
study population described in Example 4. Genomic DNA 
was extracted from these blood samples using standard 
techniques (BACC2 DNA extraction kit (Nucleon Bio- 
sciences)) and subjected to analysis. Based upon the the 
coexistence of all of the foUowing or differing combinations 
of central fat, hypertension, glucose intolerance, dyslipi- 
demia (elevated triglycerides and low HDL cholesterol), and 
impaired insuUn stimulated glucose uptake ("insulin resis- 
tance^') in a common disorder referred to as syndrome X, it 
was postulated that polymorphic variants associated with the 
development of central obesity would also be associated 
with NIDDM. 

[0271] The SNP at position 15847 of SEQ ID NO:l was 
also allelotyped and genotyped in NIDDM and non-NIDDM 
patients from the pool described above (see Example 4), 
using the same assay described in Example 2, (i.e., same 
primers provided in Tables 4 and 5). 

[0272] Genotype Analysis 

[0273] Table lOA and lOB show the allelotyping and 
genotyping results, respectively, for males and females. 
Statistical significance of each association was determined 
by the computer program QTDT (Abecasis et al., Amer. J. 
Human Genetics, 56: 279-292 (2000)). Transmission/dis- 
equilibrium tests (TDT), population stratification tests, and 
total association tests were utilized in this computer pro- 
gram. Because population stratification could have lead to 
spurioiis results from total association testing, association 
testing was considered only when there was no evidence of 
population stratification. 


TABLE lOA 



Allelotvoinc Results 


SNP Position 




15S47 in 

Allele Frequency 

Allele Frequency 

StatistLcal 

SEQ ID NO: 1 

Cases 

Controls 

Significance 

Females 

C = 0.500 

C = 0.583 

p = 0.017 


T = 0.500 

T = 0.417 


Males 

C = 0.535 

C = 0.501 

p = 0.341 


T - 0.465 

T - 0.499 



[0274] 


TABLE lOB 


Oenotyping Results 


SNP Position 




15847 in 

Allele Frequency 

Allele Frequency 

Statistical 

SEQ ID NO: 1 

Cases 

Controls 

Significance 

Females 
Males 

C - 0.438 
T - 0.562 
C - 0.472 
T- 0^28 

C - 0.504 
T - 0.496 
C - 0.440 
T - 0.560 

p - 0.03S 
p - 0.311 


[0275] Both allelot3rping and genotjping analysis revealed 
that a thymine at position 15847 of SEQ ID NO: 1 is 
associated with NIDDM in females. Similarly, a thymine at 
the same position was also found to be associated with 
central obesity (sec Example 2). Therefore, the data dem- 
onstrates this SNP serves as a marker for an increased risk 
of developing obesity or diabetes either separately or 
together as part of a greater metabolic syndrome. 

Example 6 

Sequencing of P2X4, Identifyit^ Novel SNPs and 
Determining Linkage Disequilibrium (LD) 

[0276] The P2X4 gene was sequenced using ABl's Dye- 
Terminator Cycle Sequencing. The sequencing of the ampli- 
fied DNA was carried out on ABI 377 sequencers. The 
sequences of the amplification products were determined 
using automated dideoxy terminator sequencing reactions 
with a dye terminator cycle sequencing protocol. The 
sequence data were further evaluated to detect the presence 
of polymorphic variants within the amplified fragments. The 
polymorphism search was based on the presence of super- 
imposed peaks in the electrophoresis pattern resulting from 
different bases occurring at the same position. These poly- 
morphisms are described in Table 11 below. 

TABLE 11 


P2X4. 
cnnsensus 




Pcsition.in. 

seq. 



RelativB.to. 

SEQ ID 

location 

GeneJocatLon 

Individual.ID 

IS1044249 

NO: 1 

PLA2 

exonS 

rs5637 



P2RX7 

intron7 

rs20S307 

127S01008 


P2RX7 

exonll 

rsl718119 

127S12223 


P2RX7 

exonl3 

r8l621388 

1278 19 683 


P2RX7 

3' downstream 

r826S63S4 

127822211 


P2X4 

promoter 

P2RX4- 
Picmoterl075 

127841528 

1075 

P^4 

promater 

P2RX4- 
Piomoter2383 

127S42S36 

23S3 
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TABLE ll-continued 


TABLE 12-contmued 


P2X4. 
consensus. 


location Gene.location IndividuaUD 


Relative, to. 
rsl044249 


Pasitioii.iiL. 
SEQ ID 
NO: 1 


P2X4 

promoter 

P2RX4- 

127S42947 

2494 


Proniotei2494 



P2X4 

esaol 

InDel 

127S43331 

2878 

P2X4 

exoal 

rsl 044249 

127S43495 

3042 

P2X4 

intronl 

P2RX4-3191 

127S43644 

3191 

P2X4 

intrort2 

SQNM- 

127S51483 

11030 



rsl 182945 



P2X4 

Lntron2 

GEMINl- 

127S51483 

11030 



rsl 182945 



P2X4 

Lntroa2 

P2RX4- 14744 

127855197 

14744 

P2X4 


P2RX4- 14977 

127S55430 

14977 

P:^4 

iiitron4 

P2RX4- 15046 

127S55499 

15046 

P2X4 

mtron4 

P2RX4- 

127855512 

15059 



rsl 629287 



P2X4 


rsl 044251 

127856300 

15847 

P2X4 

uitroi]5 

i»725135 

127857791 

17338 

P2X4 

mtron6 

P2EQC4-21538 

127861991 

21538 

P2X4 

ezDii? 

P2RX4-21630 

127862083 

21630 




127862161 


P2X4 

intronS 

rsll51880 

127863166 

22713 

P2X4 

intronS 

P2RX4-25264 

127S65717 

25264 

P2X4 

exon9 

P2RX4-25346 

127865799 

25346 

P2X4 

intronlO 

F2KX4-25798 

127S66251 

25798 

P2X4 

LntxonlO 

P2RX4- 25864 

127866317 

25S64 

P2X4 

intronll 

P2RX4- 26327 

127866780 

26327 

P2X4 

intronll 

P2RX4- 26334 

127866787 

26334 

P2X4 

Lntronll 

P2RX4-26343 

127866796 

26343 

P2X4 

intionll 

P2RX4-26360 

127866813 

26360 

P2X4 

Econl2 

P2EiX4-26739 

127867192 

26739 

P2X4 

IhtroiilZ 

F2RX4-27043 

127867496 

27043 

P2X4/ 

3' downstream/ 

rsl 653586 

127871244 


CAMKK2 

3'-UTR 




P2X4/ 

3' downstream/ 

rsl 553589 

127S73206 


CAMKK2 

3'-LJTR 




P2X4/ 

3' downstream/ 

rsl063843 

127877354 


CAMKK2 

3'-LJTR 




CAMK1C2 

mtron 

rs25679S3 

127SS7358 


CAMKK2 


rsl 718120 



CAMKK2 

exoal 


i27yc ikia 



[0277] Some of the SNPs provided in Table were geno- 
typed in approximately 1100 individuals from the popula- 
tion described in Example 1. The genotype frequencies are 
provided in Table 12 below. 


TABLE 12 


Individual.ID 


Genotype Frequencies 


rs5637 
rB208307 


AA (28, 3%), AG (272, 26%), GG (760, 72%) 
CC (105, 10%), CG (423, 42%), GG (475, 47%) 


Individual.ID 


Genotype Frequencies 


IS1718119 
isl 621388 
rs2686384 

P2RX4-Promoterl075 

P2RX4-PromQtBr2383 

P2RX4-Promotei2494 

InDel 

IB1044249 

P2RX4-3191 

SQNM-rsllS2945 

GEMINI-rs 1182945 

P2RX4-14744 

P2RX4-14977 

P2RX4-15046 

P2RX4-rsl 629287 

IS1044251 

IS725135 

P2RX4-21538 

P2RX4-21630 

IS25644 

IS1151880 

P2RX4-25264 

P2RX4-25346 

P2RX4-25798 

P2RX4-25S64 

P2RX4-26327 

F2RX4-26334 

P2RX4-26343 

P2RX4-26360 

F2RX4-26739 

P2RX4-27043 

IB1653586 

IB1653589 

161063843 

IS25679S3 

IS1718120 

IS891780 


C (355, 35%), CT (514, 50%), T (157, 15%) 
C (338, 35%), T (146, 15%), TC (471, 49%) 
A (159, 15%), AG (518, 50%), G (354, 34%) 
A (27, 3%), G (788, 74%), G.A (252, 24%) 
A (21, 2%X A.C (233, 22%), C (797, 76%) 
A (415, 40%), G (136, 13%), G.A (494, 47%) 
DD (833, 83%), ID (157, 16%), H (11, 1%) 

G (736, 80%), G.T (137, 15%), T (49, 5%) 
A (461, 45%), AT (441, 43%), T (112, 11%) 
AA (115, 11%), AT (455, 43%), TT (479, 46%) 

C (742, 71%), CT (284, 27%), T (21, 2%) 
A (3, 0%), A.G (106, 10%), G (943, 90%) 
C (2, 0%), CT (102, 10%), T (936, 90%) 
A (117, 11%), G (466, 45%), G.A (453, 44%) 
C (200, 20%), T (317, 32%), T/C (472, 48%) 
A (657, 66%), A/G (215, 22%), G (117, 12%) 
C (826, 81%), T (13, 1%), T.C (184, 18%) 
A (790, 80%), A/G (187, 19%), G (12, 1%) 

C (12, 1%), CT (185, 19%), T (792, 80%) 
-(16, 2%), T (S39, 80%), T. -(189, 18%) 
A (1010, 96%), GA (37, 4%) 
C (17, 2%), G (7S4, 75%), GC (245, 23%) 
A (937, 90%), A.G (100, 10%), O (2, 0%) 

A (11, 1%), C (S18, 7S%), C.A (215, 21%) 
C (929, 89%), CT (114, 11%), T (1, 0%) 


C (464, 44%), CG (454, 44%), G (125, 12%) 
G (834, 91%), T (2, 0%), TG (76, 8%) 
C (903, 90%), T (3, 0%), TC (96, 10%) 
C (623, 65%), T (35, 4%), TC (307, 32%) 

G (271, 26%), GT (504, 48%), T (270, 26%) 


[0278] The extent of linkage disequilibrium (LD) between 
each pair of SNPs provided above was estimated as the 
difference between the observed two locus haplotype fre- 
quency using the major alleles at each SNF and the product 
of the observed major allele frequencies. The disequilibrium 
between SNPs was expressed by a common standardized 
metric, D' (D/min(piq2, p2qi)), where p, and q, were the 
minor allele frequencies at two SNPs, and p2 and q2 were the 
corresponding major allele frequencies. Significant devia- 
tion of this disequilibrium from zero was tested by the use 
of a chi-square goodness-of-fit test. The results of the LD 
analysis are provided in Table 13 and in FIG. 6. 


TABLE 13 


Haplotype 


Freq Rel.Freq Cum.Rel.Freq 


GCUAGCADGATCGTGCACATTACAACCGCCG 


128 0:058 0:058 

119 0:054 0:112 

GGCCGGCADGATCGTGTACATTAGA.CCCC3CTT 115 0:053 0:165 

GGCCGGCADTATCGTGTACATTAGACCGSCCG S4 0:043 0:20B 

GGCCGGCADGATCGTGTACATTAGACCCCSCTG 82 0:037 0:245 
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TABLE 13-con-bl nued 


Haplotype Freq Rel.Freq Cum.Rel.Freq 

6 GGCCGGAGIGTATGTACGTGC-AGACCGGCCT 72 0:033 0:278 

7 GGCC6GCADGATCGTGTACATTAGACCGGCCG 50 0:023 0:301 
B GGCCGGCADGATCGTGTACATTAGACTGGCCG 45 0l021 Ol322 
9 GCTTAGCADGATCGTGTACATTAGACCGGCCT 44 0:02 0 0:342 

1 0 GGGCGGCA]:GIATGTGCGTGC-AGA.CCCGCCT 43 0:020 0:362 

11 GCTTAGCGDGTATACACGCAUAGGCCCTTCG 38 0:017 0:379 

12 GCTTAGCADGATCGTGTACATTAG&CCCGCTT 37 0:017 0:396 

13 GGTTAACGDGTACGTACACATTAGACCCGCCT 34 0:016 0:412 

1 4 GCCCGGCADGATCGTGTACATTAGACCGGCCT 29 0:013 0:425 

15 AGCCGGCADGATCGTGTACATTAGACCGGCCT 27 0:012 0:437 

16 ACCCGGCADGATCGTGTACATTAGACCCGCTT 27 0:012 0:449 

17 GGCCGGCADGATCGTGTACATTAGACTGGCCT 26 0:012 0:461 

1 8 AGCCGGCADTATCGTGTACATTAGACCGGCCG 24 0:011 0:472 

1 9 GCTTAACGDGTACGTACACATTAGACCCGCCT 23 0:011 0:483 

20 GGTTAGCGDGIATACACGCATTAGGCCCTTCG 21 0:010 0:493 

21 GGCCGGCADGATCQTGTAiCATTAGAiCCOGCCT 18 0:008 0:501 

22 GCTTAGCGDGATCGTGCACATTACAACOGCCG 18 0:008 0:509 

23 GGCCGGCADTATCGTGTACATTAG&CCOGCTT 17 0:008 0:517 

24 AGCCGGCADGATCGTGTACATTAGACCCGCTT 17 0:008 0:525 

25 ACTTAGCADGATCGTGCACATTACAACCGCCG 17 0:008 0:533 
2 6 GGTTAACGDGTACGTACGCATTAGACCCGCCG 16 0:007 0:540 

2 7 GGTTAGCADGATCGTGTACATTAGACCGGCCT 15 0:007 0:547 

28 GGTTAACGDGTACGTACACATTAGACCCGCCG 15 0:007 0:554 

29 GGCCGGCADTATCGTGTACATTAGACCGGCCT 15 0:007 0:561 

30 GGTTAACGDSTACGTACGCATTAGACCGGCCT 14 0:006 0:567 

31 GGCCGGAGDT&TCGTATACATTAGACGGGCCG 14 0:006 0:573 

32 GGCCGACCH3GTACGTACACATTAGACCCGCCG 14 0:006 0:579 

33 ACTTAGCADGATCGTGTACATTAGACCGGCCT 14 0:006 0:S85 

3 4 GCTTAGAGDGTAGGTACGCATTACACCCGGCG 13 0:005 0:591 

35 GCTTAGAGDGTACGTACACATTACACCCGCCG 13 0:006 0:S97 

36 AGTTAACGDGTACGTACGCATTAGACCCGCCT 13 0:006 0:603 

37 GGCCGGAGIGTATGTACGTGG-AGACCCGCCG 12 OlOOS 0:608 
3 8 GGCCGACGDGTACGTACACATTGGAGCCGCCG 12 0:005 0:613 

3 9 GGCCGACGDGATCGTATACATTAGACCGGCCT 12 0:005 0:618 

4 0 GGTTAGCADGATCGTGCACATTACAAGCGCCG 11 0:005 0:623 

41 GGGCGACGDGATCGTATACATTAG&CCCGCTT 11 0:005 0:628 

42 GCTTAGCADGTATACGCGCATTAGGCCCTTCG 11 0:005 0:633 


us 2004/0018533 Al Jan. 29, 2004 

40 


TABLE 13-con-b± nued 


Haplotype Freq Rel.Freq Cum.Rel . Freq 

4 3 GCTTAGCADGATCGTGCACATTACAACCGCCT 11 0:005 0:638 

44 AGTTAGCADGATCGTGTACATTAGACCGGCCT 11 0:005 0:643 

45 AGCCGGCADTATCGTGTACATTAGACCQGCCT 11 0l005 0l648 
4 6 AGCCGGCADGATCGTGTACATTAGACCCGCTG 11 0:005 0:653 
4 7 GGCCGGAGDGATCGTATACATTAGACCGGCSG 10 0:005 0:658 
4 8 GGCCGACGDGATCGTATACATTAGACCGGCCG 10 0:005 0:663 

4 9 GCTTAACGDGTACGTACGCATTAG&CCCGCCT 10 0:005 0:668 

5 0 GCCCGGCADGATCGTGTACATTAGACCCGCTT 10 0:005 0:673 

5 1 GCTTAACGDGATCGTATACATTAGACCCGCTT 9 0:004 0:677 

52 AGCCGGAGIOIATGTACGTGC-AG&CCCGCCT 9 0:004 0:681 

53 ACTTAGCADGATCGTGTACATTAGACCCGCTT 9 0:004 0:685 
5 4 ACTTAGAGDGATCGTACACATTACAACCGCCG 9 0:004 0:689 
55 GGCCGGCADGATCGTGTACATTAGACCCGCCG 8 0:004 0:693 

5 6 GGCCGACGIGTATGTACGTGC-AGACCCGCCT 8 0:004 0:697 

57 GGCCGACGDTATCGTATACATTAG&CCGGCGG B 0:004 0:701 

58 AlGTIAGCADGTACGTGCGCATTAGACCOGCCT 8 0:004 0:705 

59 AGTTAACGDGTACGTACACATTAGACCOGCCG 8 0:004 0:709 

60 ACTTASAGDGTACGTACGGATTACACCOGCCG 8 0:004 0:713 

61 GGTTAGCAZGTATGTGCGTGC-AGACCCGCCG 7 0:003 0:716 

62 GGCCGGCCTIGATCGTATACATTAGACCGGCCT 7 0:003 0:719 

6 3 GGCCGGCADGTACGTGCACATTAGACCCGCTG 7 0:003 0:722 

6 4 GGCCGGAGIGTACGTACGCGCTAGACCCGCCT 7 0:003 0:725 

65 GGCCGGAGDGATCGTATACATTAGACCGGCCT 7 0:003 0:728 

66 GGCCGACGDGTACGTACGCATTAGACCCGCCG 7 0:003 0:731 

67 GCTTAGCADGATCGTGTACATTAGACGGGCCG 7 0:003 0:734 

68 GCTTAACGDGATCGTACACATTACAACCGCCG 7 0:003 0:737 

69 AGTTAGCADGTACGTGCACATTAGACCCGCCG 7 0:003 0:740 

70 ACCCGGCADGATCGTGTACATTAGACCGGGCT 7 0:003 0:743 

7 1 GGTTAGCADGTATACGCGCATTAGGCCCTTCG 6 0:003 0:746 

72 GGTTAGCADGTACGTGCGCATTAGACCCGCCT 6 0:003 0:749 

73 GGTTAGCADGTACGTGCACATTAGACCCGCCG 6 0:003 0:752 

74 GGTTAGCADGATCGTGTACATTAGACCCGCTT 6 0iO03 0:755 

75 GGTTAGCADGATCGTGTACATTAGACCCGCCT 6 0:003 0:758 

7 6 GGTTAGAGIGTATGTACGTGC-AGACCCGCCG 6 0:003 0:761 
7 7 GGTTAGAGDGTACGTACACATTAGACCCGCCT 6 0:003 0:764 

78 GGCCGGCADOIACGTGCACATTGGACCCGCCG 6 0:003 0:767 

79 GGCCGGAGDGATCGTATACATTAGACCCGCTT 6 0:003 0:770 
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TABLE 13-con-b± nued 


Haplotype Freq Rel.Freq Cum.Rel.Freq 

80 GCTTAGCADGATCGTGTACATTAGAGTGGCCG 6 0:003 0:773 

81 GCTTAGAGDGATCGTATACATTAGACCCG6TT 6 0:003 0:776 

82 GCCCGGCADGATCGTGTACATTAGACTGGCCT 6 0lO03 0l779 

83 GCCC6GCADGATCGTGTACATTAGACCCGCTG 6 0:003 0:782 

84 GGCCSGCADTATCGTGTACATTAGACCCGCTG 5 0:002 0:784 

85 GGCCGACGDGTACGTACGCATTGGACCCGCCG 5 0:002 0:786 

86 GCTTAGCGDGTACGTACGCATTAG&CCCTTCG 5 0:002 0:788 

8 7 GCTTAGCADGTACGTGCGCATTAGACCCGCCG 5 0:002 0:790 

88 GCTTAAAGDGTACGTACGCATTAGACCCGCCG 5 0:002 0:792 

89 GCTTAAAGDOIACGTACAGATTAGACCCGCSG 5 0:002 0:794 

90 GGCCGGCCHDGATCGTGTACATTAGACCQGCCT 5 0:002 0:796 

91 GCCCGGCADGATCGTGTACATTAGAGCGGCCG 5 0:002 0:798 

9 2 ACCCGGCADGATCGTGTACATTAGACCGGCCG 5 0:002 C:800 

93 GGTTAGCGDGTACGTACGCATTAGACCGGCCT 4 0:002 0:802 

94 GGTTAGCADGIACGTGTACATTAGACCCGCGT 4 0:002 0:804 

95 GGTIAGCADGTACGTACGCATTAGACCOGCCT 4 0:002 0:806 

96 GGTTAGAGDGTACGTGCACATTAGACCOGCCT 4 0:002 0:808 

97 GGTTAACGDTTACGTACACATTAG&CCOGCCG 4 0:002 0:810 

98 GGCCGGCGDGTATACACGCATTAGGCCCTTCG 4 0:002 0:812 

99 GGCCGGCGDGATCGTGTACATTAGACCCGCTT 4 0:002 0:814 
10 0 GGCCGGCGDGATCGTGTACATTAGACCCGCCG 4 0:002 0:81S 

101 GGCCGGCAI GTATGTGGG-TGC- AGACCCGCCG 4 0:002 0:818 

102 GGCCGGCAI GATCGTGTACATTAGACCGGCCG 4 0:002 0:820 

103 GGCCGGCADGTACGTACACATT AGACCCGCCG 4 0:002 0:822 

104 GGCC6GCADGATTGTGTATAT-AGACCCGCGG 4 0:002 0:824 

105 GGCCGGCADGATCGTGC&CATTACAACCGCCG 4 0:002 0:826 

106 GGCCGGAGITTACGTACGCGCT AGACCCGCCG 4 0:002 0:828 

107 GGCCGGAGDGTATGTACGTGC-AGACCCGCCT 4 0:002 0:830 

108 GCTTAGCADGTACGTGCGCATTAGACCCGCCT 4 0:002 0:832 

109 GCTTAGCADGATCGTGTACATTAGACCCGCCT 4 0:002 0:834 

110 GCTTAGAGDGTACGTACACATTAGACCCGCCT 4 0:002 0:836 

111 GCTTAACGDGATCGTATACATTAGACCGGCCT 4 0i002 0:838 

112 GCCCGGAGDGATCGTATACATTAGACCGGCCG 4 0:002 0:840 

113 AGTTAGCADGTACGTGCACATTAGACCCGCTT 4 0:002 0:842 

114 AGTTAGCADGATCGTGTACATTAGACCCGCCT 4 0:002 0:844 

115 AGTTAACGDGiTACGTACGCATTAGACCCGCCG 4 0:002 0:846 

116 AGCCGGCADGATCGTGTACATTAGACCCGCCG 4 0:002 0:848 
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Haplotype Freq Rel.Freq Cum.Rel . Freq 

117 AGCCGGAGDGATCGTATACATTAGACTGGCCG 4 0:002 0:85 0 

118 GGTTAGCGDGTACGTACACATTAGACCCGCCT 3 0:001 0:851 

119 GGTTAGCADSTACGTGCACATTAGACTCGCCG 3 OlOOl Ol852 

120 GGTTAGCADGATCGTGCACATTAGACCCGCCG 3 0:001 0:853 

121 GGTTAGAGIGIATSTACGTGC-AGACCCGCCT 3 0:001 0:854 

122 GGTTAGAGDGTACGTACGCAITAGACCCGCCT 3 0:001 0:855 

123 GGTTAACGDGTATACACACATTAGGCCCGCCT 3 0:001 0:856 

124 GGCCGGCGDGTACGTGCGCATTACACCCGCCG 3 0:001 0:857 

125 GGCCGGCGDGATCGTGTACATTAGACCGGCCT 3 0:001 0:858 

126 GGCCGGCGDGATCGTATACATTAGACCGGCCG 3 0:001 0:859 

127 GGCCGGCAIGATTGTATATGC-AGACCCGCTG 3 0:001 0:860 

128 GGGCGGCADTATCGTGTACATTAGACCGGCTG 3 0:001 0:861 

129 GGCCGGCADGTATGTGCGTGC-AGACCCGCCT 3 0:001 0:862 

130 GGCCGGCADGATCGTGTACATTGGACCCGCTG 3 0:001 0:863 

131 GGCCGGCADGATCGTGTACATTACAACCGCCG 3 0:001 0:864 

132 GGCCSGCADGATCGTGCACATTAGACCCGCTT 3 0:001 0:865 

133 GGCCGGAGITTATGTACGTGC-AGACCOGCCT 3 0:001 0:866 

134 GGCCGSAGIGTACGTACGCGGTACAACOGCCT 3 0:001 0:867 

135 GGCCGGAGIGTACATATGCGCTAGACCOGCCT 3 0:001 0:868 

136 GGCCGGACTIGATCGTATACATT7UGACTGGCCG 3 0:001 0:869 
13 7 GCTTAGCGDGTACGTACGCATTAGACCGTTCG 3 0:001 0:870 

138 GCTTAGCGDGATCGTGTACATTAGACCGGCCG 3 0:001 0:871 

139 GCTTAGCGDGATCGTACACATTACaaCOGCCG 3 0:001 0:872 

140 GCTTAGCADTTACGTGCACATTAGACCCGCCG 3 0:001 0:873 

141 GCTTAGCADTATCGTGCACATTAC&ACCGCCG 3 OsOOl 0:874 

142 GCITEAGCADGTACGTGCACATTACACCCGCCG 3 0:001 0:875 

143 GCTTAGCADGATCGTGCACATTAGACTGGCCG 3 OiOOl 0:876 

144 GCTTAAGGDTATCGTGTACATTAGACCGGCCT 3 0:001 0:877 

145 GGTTAACGDGTATACACGCATTAGGCCCTTCG 3 0:001 0:878 

146 GCCCGGCADTATCGTATACATTAGACCGGCCT 3 0:001 0:879 

147 AGTTAGCGDTATCGTATACATTAGACCCGCCT 3 0:001 0:880 

148 AGTTAGCAZGTATGTGCGTGC-AGACCCGCCT 3 OlOOl 0:881 

149 AGTTAGCADGATCGTGGAGATTACAACGGCCG 3 0:001 0:882 
15 0 AGTCAGCGDGATCGTATACATTAGACCCGCCT 3 0:001 0:883 

151 AGTCAACGDGTATACACGCATTAGGCCCTTCG 3 0:001 0:884 

152 AGCCGGCGDGiTACGTGCGCATTAGAACCGCCG 3 0:001 0:885 

153 AGCCGGCGDGATCGTATACATTAGACTOGCCG 3 0:001 0:886 
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Haplotype Freq Rel.Freq Cum.Rel . Freq 

154 AGCCGACGDGTACGTACACATTGGACCCGCCG 3 0:001 0:887 

155 ACTTAACGDGTACGTACACATTAGACCCGCCG 3 0:001 0:888 

156 GGTTAGCmGATCGTATACATTAGACTGGCCT 2 OlOOl 0l889 

157 GGTTAGCADGATCGTGCGCATTACAACCGCCG 2 0:001 0:890 

158 GGTTAGAGDGIACGTACACATTAGACCCGCCG 2 0:001 0:891 

159 GGTTAACGDGTATGTACATAT-ACAACCGCCT 2 0:001 0:892 

160 GGTTAACGDGTACGTACGCATTAG&CCCGCTT 2 0:001 0:893 

161 GGTTAACGDGAACGTACACATTACAACCGCCT 2 0:001 0:894 

162 GGCTGGCADGATCGTGTACATTAGACCGGCCG 2 0:001 0:895 

163 GGCCGGCGIOIATGTACGTGC-AGACTGGCCG 2 0:001 0:896 

164 GGCCGGCCHSTATCGTGTACATTAGACCGGCCG 2 0:001 0:897 

165 GGCCGGCGDGTACGTACGCATTAGACCCGCTG 2 0:001 0:898 

166 GGCCGGCGDGATCGTGTA.CATTAGACTGGCCG 2 0:001 0:899 

167 GGCCGGCGDGATCGTGTACATTAGACCGGCCG 2 0:001 0:900 

168 GGCCGGCADTTACGTGCACATTGGACCCGCCT 2 0:001 0:901 

169 GGCCSGCADTATCGTGTOCATTAGACTOGCCS 2 0:001 0:902 

170 GGCCGGCADTATCGTGTGCATTAGACCGGCCG 2 0:001 0:903 

171 GGCCGGCADTATCGTGCACATTAG&CCGGCCG 2 0:001 0:904 

172 GGCCGGCADTATCGTGCACATTACAACCGCCT 2 0:001 0:905 

173 GGCCGGCADGTACGTGCGCATTGGACCCGCCG 2 0:001 0:906 

17 4 GGCCGGCADGTACGTGCGCATTAGACCCGCCG 2 0:001 0:907 

175 GGCCGGCADGTACGTACGCATTAGACCCGCCG 2 0:001 0:908 

176 GGCCGGCADGATTACGTACATTAGGCCOGCCT 2 0:001 0:909 

177 GGCCGGCADGATTACGCACGTTAGGCCGTTCG 2 0:001 0:910 

178 GGCCGGCADGATCGTGTACATTAGACCGGCTT 2 0:001 0:911 

179 GGCCGGAGIQTTTGTACGTGC-AGACCCGCCT 2 0:001 0:912 

180 GGCCGGAGIGTACGTACGCGCTAGACCGGCCT 2 OiOOl 0:913 

181 GGCCGGAGDGTATGTACGTGC-AG&CCCGCCG 2 0:001 0:914 

182 GGCCGGAGDGTATGTACATGC- AGACCCGCCT 2 0:001 0:915 

183 GGCCGGAGDGATCGTATACATTAGACCCGCTG 2 0:001 0:916 

184 GGCCGGAADGAACGTGTACATTAGACTGGCCT 2 0:001 0:917 

185 GGCCGACGITTATGTATGTGT- AGACCCGCCT 2 OlOOl 0:918 

186 GGCCGACGDGTACGTACACATTAGACTGGCCG 2 0:001 0:919 

18 7 GGCCGACADTATCGTACGCATTAGACCCGCCG 2 0:001 0:920 

188 GCTTGGCAIGATCGTGTACATTAGACCCGCCG 2 0:001 0:921 

189 GCTTAGCGDGiTACGTACAGATTACACCCGCCG 2 0:001 0:922 

190 GCTTAGCGDGATCGTGCACATTACACCCGCCG 2 0:001 0:923 
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Haplotype 

191 GCTTAGCAIGATTGTGCGTGC-ACAACCGCTT 

192 GCTTAGCADGTACGTGCACATTAGACCCGCCT 

193 GCTTAGCADGATTGTGCATAT-AGACCCGCCG 

194 GCTTAGCADGATCGTGTACATTAGACCGGCTT 

195 GCTTAGAGDGIACGTACGCATTAGACCCGCCG 

196 GCTTAGAGDGTACGTACACATTAGACCCGCCG 

197 GCTTAGAGDGATCGTATACATTAGACCGGCCT 

198 GCWVAGAADT ATCGTACAGAT T ACAAC CGC C T 

199 GCCCGGCGDGTTTACACGCATTAGGCCCTTCT 

200 GCCCGGGGDGATCGTATACATTAGACCCGCTG 

201 GGCCGGCADTATVGTGTATAT- AGACCCGCCT 

202 GGCCGGAGIGTATGTACGTGC-AGACCCGCCG 
20 3 GCCCGGAGIGTACGTACGCGCTAGACCGGCCT 

204 GCCCGGAGDGATTGTGCGTGC-ACAACCGCCT 

205 GCGCAGCADGATCGTGTACATTAG&CCGGCCG 

206 AlGTIAGCADTATCGTGTAGATTAGACCOGCCS 

207 AGTTAGAGDGTACGTACGCATTAGACCOGCCT 

208 AGTTAACGIGTACGTACACATTAG&CCOGCCG 

209 AGCCGGCGDTTATGTATATAT-AGACCCGCCG 

210 AGCCGGCADTATCGTGTGCATTAGACCGGCCT 

211 AGCCGGCADGATTGTGCATAT-AGACCCGCCG 

212 AGCCGGCADGATCGTGTACATTAGACTGGCCG 

213 AGCCGGCADGATCGTATACATTAGACTGGCCG 

214 AGCCGACGOGTACGTAGGCATTAGACCCGCTG 

215 AGCCGACGDSTACGTACACATTAGACCCGCCG 

216 AGTTAGCADT&TCGTGTACATTAGACCCGCTT 

217 ACTTAAGCH3GAATACATOCATTAGGCGCTTCT 

218 GGTTAGCGDGATCGTGCACATTACAACCGCCG 

219 GGTTAGGGDGATCGTATACATTAGACCGGCCT 

220 GGTTAGCGDGATCGTATACATTAGACCCGCCT 

221 GGTTAGCAITTATGTGCGTGC-AGACCCGCCG 

222 GGTTAGCADTATCGTGTACATTAGACCGGCCT 

223 GGTTAGCADGATTGTGCACAT-AGAACCGCCG 
22 4 GGTTAGCADGATCGTGTACATTAGACCCGCCG 

225 GGTTAGCADGATCGTGCACATTACAACCGCCT 

226 GGTTAGAGIGAATGTACGTGC-AGACCCGCCT 

227 GGTTAGAGDGrTATACACGCATTAGGCCCTTCG 


Freq Rel.Freq Cum.Rel . Freq 


0:001 
0:001 
QiOOl 
0:001 
0 :001 
0:001 
0:001 
0 :001 
0 :001 
0:001 
0:001 
0:001 
0 :001 
0 :001 
0:001 
0:001 
0:001 
0:001 
0:001 
0:001 
0 :001 
0:001 
0:001 
0:001 
0:001 
0:001 
OiOOl 
0:000 
0 :000 
0:000 
0:000 
OiOOO 
0:000 
0 :000 
0:000 
0:000 
0:000 


0:924 
0:925 
0i926 
0:927 
0 :928 
0:929 
0:930 
0:931 
0:932 
0:933 
0:934 
0:935 
0:936 
0 :937 
0:938 
0:939 
0:940 
0:941 
0:942 
0:943 
0:944 
0 :945 
0:946 
0:947 
0:948 
0:949 
0:950 
0:950 
0 : 950 
0:950 
0:950 
0:950 
0:950 
0 :950 
0:950 
0:950 
0:950 
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Haplotype 


Freq Rel.Freq Cum.Rel.Freq 


228 GGTTAGAGDGTACATACGCATTAGACCCGCCT 

229 GGTTAACGDTTACGTACACATTAGACCGGCCG 

230 GGTTAACmGTATACACGCATTAGGCCCGGCT 

231 GGTC6GCGDGATCGTGTACATTAGACCGGCCT 

232 GGCTGGCJU: GIATGTGCGTGC- AGACCCGCST 

233 GGCTGACGDGTTTGTATGCATTAGACCCGCTT 

234 GGCTGACGDGTTTGTATACATTAG&CCCGCTT 

235 GGGCGGCGDTATCGTGCACATTAGACCGGCCG 

236 GGCCGGCGDGTTGGTATACATTAGACCGGCCT 

237 GGCGGGCGDGATCGTGTACATTAGACCCGCTG 

238 GGCCGGCCHDGATCGTGTACATTAGACCCGCCT 

239 GGCCGGCGDGATCGTATACATTAGACCCGCCG 
2 4 0 GGCGGGCGDGATCGTACACATTAGACCGGCCT 

241 GGCCGGCAIGATTGTGTATGC-AGACCCGCTG 

242 GGCCGGCADTATCGTGTACATTAGACTGGCCG 

243 GGCCSGCADGTACGTGCACATTGGACCOGCTS 

244 GGCCGGCADGTACGTGCACATTAGACCOGCCG 

245 GGCGGGCADGATTGCGTATGC-AG&CCOGTTG 

246 GGCCGGCADGATCGTGTACATTACACCOGCCT 

247 GGCCGGCADGATCGTGTACATT7U3ACCCGCTG 
24 8 GGCCGGCADGATCGTGTACAT-AGGCCGGCCT 

249 GGCCGGCADGATCGTATACAT-AGACGGGCTT 

250 GGCCGGCADGAACGTGTACATTAGACCOGCTG 

251 GGCCGGAGDGTTCATATAGATTAGACCCTCGT 

252 GGCCGGAGDSTACATAGACATTAGAGCCGCCT 

253 GGCCGGAGDG&TCGTGTACATTAGAGCCGCTT 

254 GGCCGACCH3TTACGTACACATTGGAGCCGCCG 

255 GGCCGACGDGTACGTACGGATTAGACCGGCGG 

256 GGCCGACGDGATCGTATACATTAGACCCGTGT 

257 GGCCGACGDGATCGTACACATTACAACCGCCG 

258 GCTTGGCADGATCGTGTACATTAGACCCGCCG 

259 GCTTGGAGDGATCGTATACATTAGACCCGCTT 

260 GCTTAGCGDTTATAGACGCATTAGGCCCTTCG 
2 61 GCTTAGCGDGTATATACGCATTAGGCCCTTCG 

262 GCTTAGCGDGTATACGCGCATTAGGCCCTTCG 

263 GCTTAGCGDGiTACGTACGCATTAGACCCGCCG 

264 GCTTAGCGDGATCGTGCACATTAGACGCXSCCG 


0:000 
0:000 
QiOOO 
0:000 
0 :000 
0:000 
0:000 
0 :000 
0 :000 
0:000 
0:000 
0:000 
0 :000 
0 :000 
0:000 
0:000 
0:000 
0:000 
0:000 
0:000 
0 :000 
0:000 
0:000 
0:000 
0:000 
0:000 
OiOOO 
0:000 
0 :Q00 
0:000 
0:000 
OiOOO 
0:000 
0 :000 
0:000 
0:000 
0:000 


0:950 
0:950 
Oi950 
0:950 
0 :950 
0:950 
0:950 
0 : 950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0 :95 0 
0:950 
0:950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0 :950 
0:950 
0:950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0:950 
0:950 
0:950 
0:950 
0 :950 
0:950 
0:950 
0:950 
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Haplotype 

265 GCTTAGCGDGATCGTGCACATTACAACCGTCG 

266 GCTTAGCGDGATCGTATACATTACAACCGCCG 

267 GCTTAGCAIGATTGTACGTGC-ACAACCGCCT 

268 GCTTAGCADTATCGTGTACATTAGACCGGCC6 

269 GCTTAGCADGIATACGTGCATTAGGCCCrrTCG 

270 GCTTAGCADGTACGTGCACATTAGACCCGCCG 

271 GCTTAGCADGATCGTGTACATTAG&CCCGCTG 

272 GCTTAGCADGATCGTGCGCATTACAACCGCCG 

273 GCTTAGCADGATCGTACACATTACAACCGCCG 

274 GCTTAGCADGAACGTGCACATTACAACCGCCG 

275 GCTTAGACHDGTACGTATACATTAGACCCGCCT 

276 GCTTAGAGDGTACGTACACGTTACACCCGCCG 
2 7 7 GCTTAGAGDGTACGTACACATTAG-ACCGGCCT 

278 GCTTAGAGDGTACATACGCATTAGACCCGCCT 

279 GCTTAGAGDGATTACACACATTAGGCCGGCCG 

281 GCTTAGAAIGATTGTACGTGC-ACACCOGCCT 

282 GCTTASAADGATCGTGTACATTAG&CCGGCCT 

283 GCTTAACGDTTACGTACACATTAGACCCGCCT 

284 GCTTAACCTIGATCGTATACATT7U3ACCCGCTG 
28 5 GCTTAACGDGATCGTATACATTAGACCCGCCG 

286 GC 1 TAACGDGATGGTAGACATTAGACCCGCCG 

287 GCTCAGCGDGATCGTGTACATTAGACCGGCCG 

288 GCCTAGCGDGTATACGCGCATTAGGCCCTTTG 

289 GCCTAGCADGATCGTGCACATTAGAACCGCCG 

290 GCCCGGCGDTTATACaCGCATTAGGCCCrTCG 

291 GCCCGGCGOGTATACACGCATTAGGCCCTTCG 

292 GCCCGGCADGTACGTACACATTAGACCCGCCG 
2 9 3 GGCCGGCADGATTACGTACATTAGGCCCGCCT 

294 GCCCGGAGIGATTGTACGTGC-ACACCCGCCT 

295 GCCCGGAGDGATTGTGCATGC-ACAACCGCCT 

296 GCCCGACGDGTAGGTACACATTGGACCCGCCG 

297 GCCCGACGDGTACGTACACATTAGACCCGCCG 
2 9 8 GCCCAGCADGTTCGTATATATTAGACCCGTTT 

299 AGTTAGCADGTATGTGCGTGC- AGACGCGCCT 

300 AGTTAACGIQTACGTATACATTAGACCCGCCG 

301 AGTTAACGDTATCGTATACATTAGACCCGCCT 


Freq Rel.Freq Cum.Rel.Freq 


0:000 
0:000 
QiOOO 
0:000 
0 :000 
0:000 
0:000 
0 :000 
0 :000 
0:000 
0:000 
0:000 
0 :000 
0 :000 
0:000 
0:000 
0:000 
0:000 
0:000 
0:000 
0 :000 
0:000 
0:000 
0:000 
0:000 
0:000 
OiOOO 
0:000 
0 :Q00 
0:000 
0:000 
OiOOO 
0:000 
0 :000 
0:000 
0:000 
0:000 


0:950 
0:950 
Oi950 
0:950 
0 :950 
0:950 
0:950 
0 : 950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0 :95 0 
0:950 
0:950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0 :950 
0:950 
0:950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0:950 
0:950 
0:950 
0:950 
0 :950 
0:950 
0:950 
0:950 
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TABLE 13-con-b± nued 


Haplotype 

302 AGCCGGCGIGTATGTACGTGC-AGACCCGGCG 

303 AGCGGGCGDGATCGTGTACATTAGACCCGCTT 

304 AGCCGGCADTATCGTGTACATTAGACCCGCTT 

305 AGCC6GCADTATCGTGTACATTAGACCCGCC6 

306 AGCGGGCADTATCGTGTAlGATTACACCGGCCT 

307 AGCCGGCADGATTGTGTATAT-AGACCCGCCG 

308 AGCCGGCADGATGGTGCACATTAGACCCGCTT 
3 0 9 AGCCGGAGIGTATGTACGTGC-AGACCGGCCG 

310 AGCGGACGI GTATGTACGTGC- AGACCCGCCT 

311 AGCCGACGDTATCGTATACATTAGACTGGCCG 

312 AGCCGACCHSGTACGTACACATTAGACCGGCCG 

313 AGCCGACADGTACGTACGCATTAGACCCGCTG 

314 ACTTAGCADGATCGTGTACATTAGACCGGCCG 

315 ACTTAGAGDGTACGTACGCAT-ACACGCGCCG 

316 ACTTAACADGATCGTGCACATTACAACCGCCT 

317 ACnAAAGDTATCGTACAlGATTACAACOGCCG 

318 ACCCGGCGDGTACGTACGCATTGGACCOGCCG 


Freq Rel.Freq Cum.Rel.Freq 


0:000 
0:000 
QiOOO 
0:000 
0 :000 
0:000 
0:000 
0 :000 
0 :000 
0:000 
0:000 
0:000 
0 :000 
0 :000 
0:000 
0:000 
0:000 


0:950 
0:950 
Oi950 
0:950 
0 :950 
0:950 
0:950 
0 : 950 
0:950 
0:950 
0:950 
0:950 
0 : 950 
0 :95 0 
0:950 
0:950 
0:950 


[0279] FIG. 6 demonstrates that polymorphic variants in 
a region sp arming from about position 11000 to about 
position 25300 in SEQ ID NO:l are in Hnkage disequilib- 
rium. Polymorphic variants occur at positions 11030, 14744, 
14977, 15046, 15059, 15847, 17338, 21538, 21630, 21708, 
22713, and 25264, 

Example 7 

F2X4 Tissue Expression Profiles 

[Q280] Expression levels of P2X4 were determined in 
tissues of Israeli sand rats (P. obesus) by detecting RNA 

transcribed from the P2X4 gene. P2X4 expression levels are 
depicted graphically in FIG. 4 for lean sand rats. P2X4 was 
ubiquitously expressed among the tissues and expression 
levels were comparatively higher in small intestine, cerebel- 
lum, cortex, large intestine, subscapular fat, lung, and kid- 
ney. Expression was also detected in the hypothalmus and 
liver. 

[0281] For the same tissues set forth in FIG. 4, P2X4 
expression was also monitored among sand rats in three 
different groups that were exposed to different diets. Spe- 
cifically, Group A rats are lean animals. Group B rats are 
hyperinsuHnemic and obese, and Group C rats are hyperin- 
sulinemic, hyperglycemic, and obese (diabetic). The mice 
are classified into Groups A, B, or C at 16 weeks of age on 
the basis of body weight, blood glucose, and blood plasma 
insulin measurements made at 8, 12, and 16 weeks of age. 
Rats of Groups A, B, and C were identified within the same 
litter. Sand rats used in the studies were subjected to various 


stresses to determine whether a particular stimulus caused 
and changed the expression level of P2X4. In particular, two 
studies were performed. In one study, expression levels of 
P2X4 were compared in rats that were fed versus rats that 
fasted 24 hours prior to tissue harvest. In a second study, 
P2X4 expression levels were compared for rats fed 70% of 
normal food intake for two weeks prior to tissue harvest 
versus control rats. In these studies, a significant difference 
in P2X4 expression levels in the pancreas was identified 
between Group C energy restricted rats and control rats with 
statistical certainty (p=0.004). Also, a significant difference 
in pancreatic expression of P2X4 was identified between 
Group C energy restricted rats and Group A control rats 
(p=0-019). Further, where rats in Groups A, B, and C were 
combined into energy restricted groups and control groups, 
a significant difference in pancreatic P2X4 expression was 
identified (p=0.037). In all three of these associations, 
increased pancreatic P2X4 expression was observed in 
energy restricted rats. 

[0282] Gene expression was quantified using a TaqMan™ 
PGR system (ABI Prism™ 7700 Sequence Detection Sys- 
tem, Pfeikin-Eimer Applied Biosystems, Norwalk, USA) and 
was determined relative to an endogenous control gene, 
cyclophihn. cDNA was synthesized by subjecting one 
microgram of total RNA to a reverse transcription reaction 
using Superscript II RNase H-Reverse Transcriptase (Invit- 
rogen) according to manufacturer's instructions (see http 
address www.invitrogen .com/Content/World/ 

11904018.pdf). In this reverse transcriptase PGR (RT-PCR) 
procedure, the following contents were added to a nuclease- 
fi-ee microcentrifuge tube: 1 fil Oligo (dT) 12-18 (500//g/ml); 
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1 fig total RNA; 1 ^1 10 mM dNTP mix (10 mM each dATP, 
dGTP, dCTP and dTTP at neutral pH); sterile, distilled water 
to 12 //I. The mixture was heated to 65° C. for 5 minutes and 
quickly chilled on ice for at least 2 minutes. 

[0283] Contents of the tube were collected by brief cen- 
trifugation and the following were added to complete a 20-j«l 
reaction volume: 2 fA lOx First-Strand Buflfer; A fA25 mM 
MgCl^; 2 0.1 M DTT; 1 ,wl RNaseOUT Recombinant 
Ribonuclease Inhibitor (40 units//d); 1 //I (200 units) of and 
SUPERSCRIPT II. The mixture was incubated at 45'' C. for 
50 minutes and then the reaction was inactivated by heating 
at 70° C. for 15 minutes. To remove RNA complementary to 
the cDNA, 1 ^\ (2 units) of E. coli RNase H was added and 
incubated at 37° C. for 20 minutes. The resulting mixtures 
were transferred to 0.5 ml tubes and stored at -20° C. 

[0284] Oligonucleotide primers were designed based upon 
the P. obesus sequence using Primer Express software 
(version 1.5), which was obtained at the http address doc- 
s.appliedbiosystems.com/pebiodocs/04303014.pdf. The for- 
ward primer had the sequence 5' CCA ACA CTT CCG AGC 
TTG GA3' and the reverse primer had the sequence 5' TCC 
TCC TGA GCT GGG ATC AC 3'. These primers were 
designed to generate a 65 base pair amplicon. Primers for the 
endogenous control gene, cyclophilin, were designed based 
on the P. obesus sequence. Primer sequence specificity was 
confirmed by comparing the primer sequences against the 
GenBank nucleotide sequence for P2X4 using BLAST. 
Primers were synthesized at a 40 mmole concentration and 
purified by using a reverse-phase cardiidge (GeneWorks, 
Australia). 

[0285] The ability of the primers to operate in a quanti- 
tative PGR process was next determined. A standard curve 
was generated based upon threshold cycles (Ct=threshold 
cycle) for serially diluted samples. cDNA was serially 
diluted from a 1:2 dilution to a 1:16 dilution, and the 
standard curve included an undiluted sample and a ''no 
template control" (contains no cDNA). A standard curve was 
also generated using primers specific for the endogenous 
control gene (cyclophilin). These samples are set-up in 
duplicate using the following; 12.5 fA of SYBR Green 
Universal PCR master mix (cat# 4304437, http address 
docs.appliedbios5«tems.com/pebiodocs/00777601.pdf); 2.5 
fA of forward primer (1 |/M, diluted in nuclease-free water); 
2.5 fA of reverse primer (1 ,wM, diluted in nuclease-free 
water); 2.0 fA of cDNA (neat or diluted); and 5.5 ,«1 of water 
(nuclease-free) for a total volume of 25 fA. 

[0286] The PCR program recommended for the ABI Prism 
7700 procedure was utihzed and the baseline was calculated 
based upon cycles 3 to 15 and the ampUfication plot was 
based upon cycles 16 to 40. A threshold level was set 
following examination of a semi-log view of the plot. The Ct 
values for each duplicate were examined to ensure they did 
not differ by more than one Ct unit. The Ct values were 
eliminated or the experiment was repeated if they differed by 
more tiian one Ct unit. Samples were run on an agarose gel 
to identify product formation and whether or not primer- 
dimers or non-specific priming occurred. While the primer 
concentration could have been optimized if required, it was 
determined that 100 nM of each primer (final concentration) 
was adequate. 

[0287] Following the primer efficiency determination, a 
real time PCR run was executed. The conditions utilized 


were as described above except that cDNAs were diluted 1:8 
and products were not confirmed on an agarose gel. Final 
values were then calculated using the relation 2~^'^^, where 
ACt is Ct of cyclophilin subtracted from Ct of the gene of 
interest. Gene expression values were calculated as arbitrary 
units, and Ct values for cyclophilin in treated samples (e.g., 
in fasted tissues) were further examined to determine 
whether endogenous control of gene expression was altered. 
This analysis yielded quantified and standardized gene 
expression values for the amount of cDNAin each reaction. 

Example 8 

In Vitro Production of P2X4 Polj^jeptide And 
P2X4 Polypeptide Variants 

[0288] P2X4 cDNA is cloned into a pIVEX 2.3-MCS 
vector (Roche Biochem) using a directional cloning method. 
A P2X4 cDNA insert is prepared using PCR with forward 
and reverse primers having 5' restriction site tags (in frame) 
and 5-6 additional nucleotides in addition to 3' gene-specific 
portions, the latter of which is typically about twenty to 
about twenty-five base pairs in length. A Sal I lestiiction site 
is introduced by the forward primer and a Sma I restriction 
site is introduced by the reverse primer. The ends of P2X4 
PCR products are cut with the corresponding restriction 
enzymes (i.e., Sal I and Sma I) and the products are 
gel-purified. The pIVEX 2.3-MCS vector is linearized using 
the same restriction enzymes, and the fragment with the 
correct sized fragment is isolated by gel-purification. Puri- 
fied P2X4 PCR product is Ugated into the hnearized pIVEX 
2.3-MCS vector and E. coli cells transformed for plasmid 
amplification. The newly constructed expression vector is 
verified by restriction mapping and used for protein produc- 
tion. 

[0289] E. coli lysate is reconstituted with 0.25 ml of 
Reconstitution Buffer, the Reaction Mix is reconstituted 
with 0.8 ml of Reconstitution Buffer; the Feeding Mix is 
reconstituted with 10.5 ml of Reconstitution Buffer; and the 
Energy Mix is reconstituted with 0.6 ml of Reconstitution 
Buffer. 0.5 ml of the Energy Mix was added to the Feeding 
Mix to obtain the Feeding Solution. 0.75 ml of Reaction 
Mix, 50 fA of Energy Mix, and 10 ft% of die P2X4 template 
DNA is added to the E. coli lysate. 

[0290] Using the reaction device (Roche Biochem), 1 ml 
of the Reaction Solution is loaded into the reaction com- 
partment. The reaction device is turned upside-down and 10 
ml of the Feeding Solution is loaded into the feeding 
compartment. All lids are closed and the reaction device is 
loaded into the RTS500 instrument. The instmment is run at 
30° C. for 24 hours with a stir bar speed of 150 rpm. The 
pIVEX 2.3 MCS vector includes a nucleotide sequence that 
encodes six consecutive histidine amino acids on the C-ter- 
minal end of the P2X4 polypeptide for the purpose of protein 
purification. P2X4 polypeptide is purified by contacting the 
contents of reaction device with resin modified with Ni^"^ 
ions. P2X4 polypeptide is eluted from the resin with a 
solution containing free Ni^"^ ions. 

Example 9 

Cellular Production of P2X4 Polypeptide and P2X4 

Polypeptide Variants 

[0291] P2X4 nucleic acids and P2X4 nucleic acid variants 
are cloned into DNA plasmids having phage recombination 
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cites and P2X4 polypeptides and polypeptide variants are 
expressed therefrom in a variety of host cells. X phage 
genomic DNA contains short sequences known as attP sites, 
and E. coli genomic DNA contains unique, short sequences 
known as attB sites. These regions share homology, allow- 
ing for integration of phage DNA into E. coli via directional, 
site -specific recombination using the phage protein Int and 
the E. coli protein IHF. Integration produces two new att 
sites, L and R, which flank the inserted prophags DNA 
Phage excision from E. coli genomic DNA can also be 
accomplished using these two proteins with the addition of 
a second phage protein, Xis. DNA vectors have been pro- 
duced where the integration/excision process is modified to 
allow for the directional integration or excision of a target 
DNA fragment into a backbone vector in a rapid in vitro 
reaction (Gateway™ Technology (Invitrogen, Inc.)). 
[0292] A first step is to transfer the P2X4 nucleic acid 
insert into a shuttle vector that contains attL sites surround- 
ing the negative selection gene, ocdB (e.g. pENTER vector, 
Invitrogen, Inc.). This transfer process is accomplished by 
digesting the P2X4 nucleic acid from a DNA vector used for 
sequencing, and to ligate it into the muhicloning site of the 
shuttle vector, which will place it between the two attL sites 
while removing the negative selection gene ocdB. A second 
method is to amplify the P2X4 nucleic acid by the poly- 
merase chain reaction (PGR) with primers containing attB 
sites. The amplified fragment then is integrated into the 
shuttle vector using Int and IHF. A third method is to utilize 
a topoisomerase-mediated process, in which the P2X4 
nucleic acid is amplified via PGR using gene-specific prim- 
ers with the 5' upstream primer containing an additional 
CACC sequence (e.g., TOPO® expression kit (Invitrogen, 
Inc.)). In conjunction with Topoisomerase I, the PGR ampli- 
fied fragment can be cloned into the shuttle vector via the 
attL sites in the correct orientation. 

[0293] Once the P2X4 nucleic add is transferred into the 
shuttle vector, it can be cloned into an expression vector 
having attR sites. Several vectors containing attR sites for 
expression of P2X4 polypeptide as a native polypeptide, 
N-fusion polypeptide, and C-fusion polypeptides are com- 
mercially available (e.g., pDEST (Invitrogen, Inc.)), and any 
vector can be converted into an expression vector for 
receiving a P2X4 nucleic acid from the shuttle vector by 
introducing an insert having an attR site fianked by an 
antibiotic resistant gene for selection using the standard 
methods described above. Transfer of the P2X4 nucleic acid 
from the shuttle vector is accomplished by directional 
recombination using Int, IHF, and Xis (LR clonase). Then 
the desired sequence can be transferred to an expression 
vector by carrying out a one hour incubation at room 
temperature with Int, IHF, and Xis, a ten minute incubation 
at 37° C. with proteinase K, transforming bacteria and 
allowing expression for one hour, and then plating on 
selective media. Generally, 90% cloning efficiency is 
achieved by this method. Examples of expression vectors 
are pDEST 14 biicterial expression vector with att7 pro- 
moter, pDEST 15 bacterial expression vector with a TV 
promoter and a N- terminal GST tag, pDEST 17 bacterial 
vector with a T7 promoter and a N-terminal polyhistidine 
aflBnity tag, and pDEST 12.2 mammalian expression vector 
with a CMV promoter and neo resistance gene. These 
expression vectors or others like them are transformed or 
transfected into cells for expression of the P2X4 polypeptide 
or polypeptide variants. These expression vectors often are 


transfected, for example, into murine-transformed a adipo- 
cyte cell line 3T3-L1, (ATCC), human embryonic kidney 
cell line 293, and rat cardiomyocyte cell line H9C2, 

Example 10 

Cellular Assay for Screening P2X4 Interacting Fat 
Reduction Drug Candidates 

[0294] P2X4 polypeptide or P2X4 polypeptide variants 
are contacted with a test molecule and an interaction 
between P2X4 and the test molecule is monitored. Mol- 
ecules that interact with P2X4 polypeptide or P2X4 
polypeptide variants then are characterized further for an 
effect upon obesity. 

[0295] Where the P2X4 polypeptide or P2X4 polypeptide 
variant is expressed in cell membranes, an interaction 
between the molecule and the P2X4 polypeptide or P2X4 
polypeptide variant is determined by observii^ an effect of 
the molecule on calcium uptake and/or levels of cAMP 
and/or inositol phosphate. Test molecules are selected from 
libraries of molecules, which can include antagonists of P2X 
receptors (e.g., suramin, reactive blue 2, PPADS, and others) 
and agonists of P2X receptors. For example, test molecules 
may be selected from nucleotides or nucleotide analogs, 
such as adenosine, ADP, AMP, a,P-methylcne-ATP, ^,y- 
methyleoe-ATP, UTP, 2-MeSArP, and others. 

[0296] Cells that Express P2X4 Polypeptide or P2X4 
Polypeptide 'Variants 

[0297] Cells that express P2X4 polypeptides or polypep- 
tide variants (described in Example 6) may be utilized in 
cellular assays for screening molecules that reduce fat 
deposition. Also, a myocyte cellular assay described for 
example in WO 99/23214 can be used to screen for that 
reduce fat deposition molecules. Regardless of which cell 
expresses the P2X4 polypeptide or polypeptide variant, the 
effect of the molecule on P2X4 function is assessed by 
monitoring calcium uptake and cychc AMP and inositol 
phosphate level in the cells, for example. For the myocyte 
assay, myocytes are cultured by obtaining ventricular cells 
from chick embryos and culturing them for 14 days in ovo 
as described in Barry et al., Jl Physiol., 325: 243-260 (1982); 
Liang et al., Circ. Res., 76: 242-251 (1995). After neutral- 
izing trypsin with a medium comprising horse serum, the 
ventricular cells are centrifuged and resuspended in a culture 
medium comprising 6% (v/v) fetal bovine serum, 40% (vA^) 
Medium 199 (GIBCO, Grand Island, N.Y.), 0.1% (w/v each) 
penicilUn/streptomycin, and Hank's salts. The cultured ven- 
tricular cells are plated at a density of 400,000 cells per 
milliliter and cultivated in a humidified 1:19 (vol: vol) 
CO2: air atmosphere at 37° C. Cultured cardiac myocytes are 
grown to confluence on day 3 in culture and exhibit rhythmic 
spontaneous contraction. 

[0298] Calcium Ion Uptake into Cells 

[0299] Determination of '*^Ca uptake is performed as 
described in Liang et al., J. Biol. Chem., 271: 18678-18685 
(1996). Cultures are incubated with L-(3,4,5-^H, N)-leucin6 
(152.2 Curies per milKmole) for 24 hours prior to '^^Ca 
uptake. Incorporation of ^H-leudne into the cellular protein 
can permit normalization of "^^Ca content to milligrams of 
cell protein by determining 3H content, using known meth- 
ods. Following exposure to "^^Ca, cells then may be washed 
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free of '^^Ca. by rinsing the cells four times with ice-cold 
buffer containing 5 millimolar HEPES buffer, 1 millimolar 
CaCl^, 4 millimolar KCl, 0.5 milHmolar MgCls, 142 milli- 
molar NaCl, and 1 millimolar lanthanum, at pH 7.35. This 
washing procedure removes more than 99% of an extracel- 
lular marker, ^^Cr-EDTA, and substantially removes extra- 
cellular '*^Ca. Uptake of '*^Ca is quantified for about 90 
seconds. Cells are solubilized for 2 hours in a solution 
comprising 1% (w/v) sodiimi dodecyl sulfate and 10 milli- 
molar sodium borate. Aliquots of tiie solution containing 
solubilized cells is assayed for radioactivity and protein 
content. 

[0300] Also, calcium influx can be measured by monitor- 
ing a fluorescent signal generated by a compound which 
binds calcium. A compound for monitoring calcium is 
normally injected into cells before a modulator is adminis- 
tered, and then fluorescence is measured by standard imag- 
ing or photometry procedures at an emission wavelength 
before and after administration of the modulator. Examples 
of fluorescent compounds useful for monitoring calcium 
influx in cells include Quin-2 (ex=339 nm, em=400 nm), 
Fluo-3 (ex=488 nm, em=530 nm), Fura-Red (ex=488 nm, 
em— 660 nm). Calcium Green (ex=488 nm, em=530 nm), 
Calcium Orange (ex-550 nm, em=580 nm), Calcium Crim- 
son (ex=593 nm, em=615 nm), Rhod-2 (ex=550 nm, 
em=575 nm), and Fura-2 (exi=340 nm, ex2=380 nm, 
em=510 nm), which are available from Intracellular Imag- 
ing, Inc. (see e.g., http address www.iatracellular.com for 
structures of compounds and methods of use; abbreviations 
are "em" for emission wavelength, "ex" for excitation 
wavelength, "ex," for first excitation wavelength, and "ex^" 
for second excitation wavelength). 

[0301] Determination of cAMP Level in Cells 

[0302] cAMP is extracted by addition of one-tenth of the 

aHquot volume of 1 normal HCl to the media and boiling the 
mixture for 10 minutes. Extracted cAMP is assayed accord- 
ing to a radioimmunoassay method described in Liang et al., 
Ore. Res., 76: 242-251 (1995), Amersham, Arlington 
Heights, lU. The effect of P2 receptor agonist on cAMP 
accumulation is monitored for at least 10 minutes to deter- 
mine if the effect is linear, after which time the concentration 
of cAMP is assessed in the sample. 

[0303] Detemiination of Inositol Phosphate Levels in 

Cells 

[0304] Inositol phosphates are monitored according to a 
method reported by Berridge et al., Biochem. J., 212: 
473-482 (1983), as modified by Barnett et al., Biochem. 7., 
271: 437-442 (1990). For myocytes, cells are pre-incubated 
with 5 miUicuries per milliliter ^H-myo-inositol for 24 
hours, washed with Dulbecco's modified Eagle's medium 
comprising 15 millimolar LiCl (DMEM-Li), and incubated 
in DMEM-Li for 10 minutes at 37* C. The myocytes then are 
exposed to one or more test compounds. 

[0305] Following this exposure, myocytes are subjected to 
extraction using 1 milliliter of a solvent comprising a 
1:2:0.05 (vol:vol:vol) mixture of chloroform:m6thanol:HCl 
to remove inositol phosphates from the myocytes. The 
solvent comprising inositol phosphates is applied to an anion 
exchange column (AGx8 resin, formate form, 1 milliliter 
bed volume) and InsP^, InsPj, and InsPg are eluted sequen- 
tially using a solution comprising 100 millimolar formic 


acid and 200 millimolar ammonium formate, a solution 
comprising 100 millimolar formic acid and 600 millimolar 
ammonium formate, and a solution comprising 100 milli- 
molar formic acid and 1 M ammonium formate, respec- 
tively. The anion exchange column is calibrated with each 
inositol phosphate standard to confirm complete separation 
of InsPj, InsP2, and InsPg. Recovery of each inositol phos- 
phate often is greater than 95%. 

[0306] The effect of ATP receptor agonists on inositol-1, 
4,5-trisphosphate (InsPg) level is quantified using an InsPg 
radioreceptor assay. Growdi media in which ventricular cells 
are grown can be replaced with a solution comprising 
HEPES buffer, 1 millimolar CaCU, 4 millimolar KCl, and 
0.5 miUimolar MgCl^ at pH 7.35. The ceUs then are exposed 
to ATP. The reaction can be terminated by addition of 0.2 
volumes of ice-cold trichloroacetic acid (TCA), which is 
removed by extraction with a solution comprising TCTFE 
(l,l,2-trichloro-l,2,2-trifluoroethane)-trioctylainine. InsPg 
in the aqueous phase may be determined by competition 
with ^H-InsPg for binding to the InsPg receptor supplied as 
a part of a kit (Dupont, Boston, Mass.; Liang et al., J. Biol. 
Chem., 271: 18678-18685 (1996)). 

[0307] Sources for Materials 

[0308] Embryonic chick eggs can be obtained from Spaf as 
Inc. (Storrs, Conn.). AcAMP radioimmunoassay kit can be 
obtained from Amersham (Arlington Heights, 111.). ^H-leu- 
cine, ^H-myoinositol, an InSPg radioreceptor assay kit, and 
"^Ca may be obtained from Dupont (Boston, Mass.). 
Adenosine, ADP, AMP, a,p-methylene-ATP, p,Y-methylene- 
ATP, and UTP may be obtained from Sigma Chemical Co. 
(St. Louis, Mo.). 2-MeSATP may be obtained from Research 
Biochemical International (Natick, Mass.). U-73122, which 
is also referred to as l(6-((17b-3-methoxyestra-l,3,5(10)- 
trien-17-yl)amino)hexyl)- lH-pyrrole-2,5-dione, and 
U-73343, also referred to as l(6-((17b-3-methoxyestra-l,3, 
5(10)-trien-17-yl)amino)hexyl)-23-pyn:olidine -dione may 
be obtained from BIOMOL (Plymouth Meeting, Pa.). 

Example 11 

In Vivo Assay for Screening Fat Reduction Drug 
Candidates 

[0309] Test compoimds are soreened for fat reduction 
activity by administering molecules which interact with 
P2X4 to Israeli sand rats {P. obsesus), which is an accepted 
in vivo model for obesity, and observing the effect of the 
molecule on such parameters as weight, dimensions, and/or 
fat content. Molecules may be administered to obese ani- 
mals and/or non-obese animals. These animals are grouped 
into four sets (Table 9), where group D animals have high 
morbidity and are not typically used in studies. 


TABLE 9 


Group 

Phenotype 

Plasma glucose/Insulin 

Group A (27%) 
Group B (35%) 
Group C (30%) 
Group D (8%) 

Healthy 
Healthy/Obese 
DiabetWObese 
DiabeticVObese 

Normo gl ycemic/no rmo LnsiiLineinic 
Normoglyccmic/hyperinsulineimc 
Hypeiglyceroic/hypeniisulitiiemic 
Hypeiglycemiq^hypoinsulinemic 


[0310] The Israeli sand rat is maintained on an ad libitum 
diet of a standard lab chow that is high in energy. This 
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polygenic animal displays in response to this diet a range of 
body weights, plasma insulin and blood glucose levels. 
Normally, eight controlled animals and eight treated animals 
are included for groups A, B and C, giving a total of 48 
animals for each study. 

[0311] The test molecule is delivered to the animals by 
intraperitoneal injection; intravenous injection; intragastri- 
cal administration, in which case twice as many animals per 
group should be used since the method of administration is 
more stressful and leads to a higher motility rate; continuous 
infusion using an osmotic pump; and orally ad libitum, 
which is the least stressful as the test molecule is added to 
food and the amount of consumed is measured. Often 
DMSO or water is used as a vehicle accompanying the test 
molecule and 10 //g to 1000 /tg of test molecule per kilogram 
of the animal is typically administered. 

[0312] The length of the study is typically one to seven 
days. During the study, several parameters are measured, 
including body weight (daily measurements); food intake 
(daily measurements); blood glucose levels (before and after 
the study); plasma insulin levels (before and after the study); 
circulating blood metabolites such as leptin, Cortisol, trig- 
lycerides and free fatty acids (before and after the study); 
percent body fat (weighing fat pads at the end of the study); 
quantification of gene expression in tissues such as the 
pancreas, mesenteric fat, stomach and small intestine (at the 
end of the study); and measurements of P2X4 activity in 
tissues such as pancreas, mesenteric fact, stomach, and small 
intestine using methods described in Example 7 (before 
and/or after the study). Animals are sacrificed by anaesthetic 
overdose and tissues are harvested and rapidly frozen. RNA 
is extracted from half of each harvested tissue and P2X4 
polypeptide extracts are sometimes generated from the other 
half. 

[0313] Modifications may be made to the foregoing with- 
out departing £rom the basic aspects of the invention. 
Although the invention has been described in substantial 
detail with reference to one or more specific embodiments, 
those of skill in the art violl recognize that changes may be 
made to the embodiments specifically disclosed in this 
application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the 
claims which follow. All pubhcations or patent documents 
cited in this specification are incorporated herein by refer- 
ence as if each such publication or document was specifi- 
cally and individually indicated to be incorporated herein by 
reference. 

[0314] Citation of the above publications or documents is 
not intended as an admission that any of the foregoing is 
pertinent prior art, nor does it constitute any admission as to 
the contents or date of these publicatioos or documents. U.S. 
patents and other publications referenced herein are hereby 
incorporated by reference. 

What is claimed is: 

1. A method for diagnosing a predisposition to fat depo- 
sition in a subject, which comprises detecting the presence 
or absence of a polymorphic variation associated with fat 
deposition at a polymorphic site in a P2X4 nucleotide 
sequence in a nucleic acid sample from a subject, wherein 
the P2X4 nucleotide sequence comprises a polynucleotide 
sequence selected from the group consisting of: 


(a) the nucleotide sequence of SEQ ID NO:l; 

(b) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO:2 
or SEQ ID NO:3; 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) 

comprising the polymorphic site; 

whereby the presence of the polymorphic variation is 
indicative of a predisposition to fat deposition in the 
subject. 

2. The method of claim 1, which further comprises 
obtaining the nucleic acid sample from the subject. 

3. The method of claim 1, wherein the polymorphic 
variation is an adenine at position 11030 of SEQ ID NO:l. 

4. The method of claim 3, wherein tiie polymorphic 
variation is in linkage disequifibrium with the adenine at 
position 11030 of SEQ ID NO:l. 

5. The method of claim 1, wherein the polymorphic 
variation is a thymine at position 15847 of SEQ ID NO:l. 

6. The method of claim 5, wherein the polymorphic 
variation is in linkage disequilibrium with the thymine at 
position 15847 of SEQ ID NO:l. 

7. The method of claim 1, wherein the polymorphic 
variation is a cytosine at position 17338 of SEQ ID NO:l. 

8. The method of claim 7, wherein the polymoiphic 
variation is in linkage disequifibrium with the cytosine at 
position 17338 of SEQ ID NO:l. 

9. The method of claim 1, wherein the polymorphic 
variation is a GGGCCCC nucleic acid insert between posi- 
tions 2878 and 2891 in SEQ ID NO:l. 

10. The method of claim 9, wherein the polymorphic 
variation is in linkage disequilibrium with the GGGCCCC 
nucleic acid insert between positions 2878 and 2891 of SEQ 
ID NO:l. 

11. The method of claim 1, wherein detecting the presence 

or absence of a polymorphic variation comprises: 

hybridizing an oligonucleotide to the nucleic acid sample, 
wherein the oligonucleotide is complementary to the 
F2X4 nucleotide sequence and hybridizes to a reg;ion of 
the P2X4 nucleotide sequence that is ac^acent to the 
polymorphic variation; 

extending the oligonucleotide in the presence of one or 
more nucleotides, yieldii^ extension products; and 

detecting the presence or absence of the polymorphic 
variation in the extension products. 

12. The method of claim 11, wherein the ofigonucleotide 
is selected from the group consisting of AAAAAAGAC- 
CGAACAC (SEQ ID NO: 5), CATTAGCTCGGTCTCC 
(SEQ ID NO: 6), GCTTTCAACGGGTC (SEQ ID NO: 7), 
TAGCTGGGAITACAGAC (SEQ ID NO: 8), CATGTC- 
CTGGAAAC (SEQ ID NO: 9), GCCTTCTGCCTTGTA 
(SEQ ID NO: 10), CCTCAAAAAAAGACCGAACAC 
(SEQ ID NO: 11), GCGTAGCTTTCAACGGGTC (SEQ ID 
NO: 12), ACGGCCATGTCCTGGAAAC (SEQ ID NO: 13), 
GAAGAAAAGACACAGAGACAC (SEQ ID NO: 14), 
TGGCAGGATGTTTCTCCTGT (SEQ ID NO: 15), 
AGGATCCGGAAAGAGTTTCC (SEQ ID NO: 16), 
CAGCTTTCTCTACTArArrCArGTAA(SEQ ID NO: 17), 
GCTCATTAGCTCGGTCTCC (SEQ ID NO: 18), 
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GGAAAAACACGAAAGGCTTAGTT (SEQ ID NO: 19), 
TCAGGCATGTGATCAGTTAGAGCC (SEQ ID NO: 20), 
and CCTTCCCACTGAGCACATC (SEQ ID NO: 21). 

13. The method of claim 1, wherein the fat deposition is 
central fat deposition in the subject. 

14. The method of claim 1, wherein the subject is a 
human. 

15. A method for diagnosing a predisposition to leanness 
in a subject, which comprises detecting the presence or 
absence of a polymorphic variation associated with leanness 
at a polymorphic site in a P2X4 nucleotide sequence in a 
nucleic acid sample from a subject, wherein the P2X4 
nucleotide sequence is selected from the group consisting of: 

(a) the nucleotide sequence of SEQ ID NO:l; 

(b) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO:2 
or SEQ ID NO:3; 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID N0:3; and 

(d) a fragment of a nucleotide sequence of (i), (ii), or (iii) 
comprising the polymorphic site; 

whereby the presence of the polymorphic variation is 
indicative of leanness in the subject. 

16. The method of claim 15, wherein the polymorphic 
variation is selected from the group consisting of a thymine 
at position 11030, a thymine at position 15847, a thymine at 
position 17338, and the sequence GGGCCCCGGGCCCC 
between positions 2878 and 2891 in SEQ ID NO:l. 

17. The method of claim 15, wherein the polymorphic 
variation is a thymine at position 11030, a thymine at 
position 15847, a thymine at position 17338, an adenine at 
position 21706, and a thymine at position 22713 of SEQ ID 
NO:l. 

18. A method for identifying a polymorphic variation 
associated v^th fat deposition proximal to an incident poly- 
morphic variation associated with fat deposition, which 

comprises: 

identifying a polymorphic variant proximal to the incident 
polymorphic variant associated with fat deposition, 
wherein the incident polymorphic variant is in a P2X4 
nucleotide sequence and the P2X4 nucleotide sequence 
comprises a polynucleotide sequence selected from the 
group consisting o£ 

(a) a polynucleotide sequence set forth in SEQ ID NO: 
1; 

(b) a polynucleotide sequence that encodes a polypep- 
tide having an amino acid sequence encoded by a 
nucleotide sequence set forth as SEQ ID NO: 1; or 

(c) a polynucleotide sequence that encodes a polypep- 
tide having an amino acid sequence that is 90% 
identical to an amino acid sequence encoded by a 
nucleotide sequence set forth in SEQ ID NO: 1; and 

determining the presence or absence of an association of 
the proximal polymorphic variant with fat deposition. 

19. The method of claim 18, wherein the first polymorphic 
variant is located at position 2878, 11030, 15847, or 17338 
of SEQ ID NO: 1. 


20. The method of claim 18, wherein the proximal poly- 
morphic variant is within a region spanning about 5 kb 5' of 
the incident polymorphic variant and about 5 kb 3' of the 
incident polymorphic variant. 

21. The method of claim 18, which further comprises 
determining if the proximal polymorphic variant is in link- 
age disequilibrium with the incident polymorphic variant. 

22. The method of claim 18, which further comprises 
identifying a second polymorphic variant proximal to a 
proximal polymorphic variant of claim 18 associated with 
fat deposition and determining if the second polymorphic 
variant is associated with fat deposition. 

23. The method of claim 22, wherein the second proximal 
polymorphic variant is within a region spanning about 5 kb 
5' of the incident polymorphic variant and about 5 kb 3' of 
the proximal polymorphic variant associated with fat depo- 
sition. 

24. A method for diagnosing a predisposition to non- 
insulin dependent diabetes mellitus (NIDDM) in a subject, 
which comprises detecting the presence or absence of a 
polymorphic variation associated with NIDDM at a poly- 
morphic site in a P2X4 nucleotide sequence in a nucleic acid 
sample from a subject, wherein the P2X4 nucleotide 
sequence comprises a polynucleotide sequence selected 
from the group consisting of: 

(a) the nucleotide sequence of SEQ ID NO:l; 

(b) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID N0:2 
or SEQ ID NO:3; 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a nucleotide sequence of (i), (ii), or (iii) 
comprising the polymorphic site; 

whereby the presence of the polymorphic variation is 
indicative of a predisposition to NIDDM in the subject. 

25. The method of claim 18, wherein the polymorphic 
variation is a thymine at position 15847 of SEQ ID NO:l. 

26. The method of claim 19, wherein the polymorphic 
variation is in linkage diseqxiihbrium with the thymine at 
position 15847 of SEQ ID NO: 1. 

27. A method for identifying a polymorphic variation 
associated with NIDDM proximal to an incident polymor- 
phic variation associated with NIDDM, which comprises: 

identifying a polymorphic variant proximal to the incident 
polymorphic variant associated with NIDDM, wherein 
the incident polymorphic variant is in a P2X4 nucle- 
otide sequence and the P2X4 nucleotide sequence 
comprises a pol3mucleotide sequence selected from the 
group consisting of: 

(a) a polynucleotide sequence set forth in SEQ ID NO: 
1; 

(b) a polynucleotide sequence that encodes a polypep- 
tide having an amino acid sequence encoded by a 
nucleotide sequence set forth as SEQ ID NO: 1; or 

(c) a polynucleotide sequence that encodes a polypep- 
tide having an amino acid sequence that is 90% 
identical to an amino acid sequence encoded by a 
nucleotide sequence set forth in SEQ ID NO: 1; and 
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determining the presence or absence of an association of 
the proximal polymorphic variant with NIDDM. 

28. The method of claim 27, wherein the first polymorphic 
variant is a thymine at position 15847 of SEQ ID NO: 1. 

29. The method of claim 27, wherein the proximal poly- 
morphic variant is within a region spanning about 5 kb 5' of 
the incident polymorphic variant and about 5 kb 3' of the 
incident polymorphic variant. 

30. The method of claim 27, which further comprises 
determining if the proximal polymorphic variant is in link- 
age disequihbrium with the incident polymorphic variant. 

31. The method of claim 27, which further comprises 
identifying a second polymorphic variant proximal to a 
proximal polymorphic variant of claim 27 associated with 
NIDDM and determining if the second polymorphic variant 
is associated with NIDDM. 

32. The method of claim 31, wherein the second proximal 
polymorphic variant is within a region spanning about 5 kb 
5' of the incident polymorphic variant and about 5 kb 3' of 
the proximal polymorphic variant associated with NIDDM. 

33. An isolated nucleic acid which comprises a P2X4 
nucleotide sequence selected from the group consisting of: 

(a) the nucleotide sequence of SEQ ID NO:l; 

(fa) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO: 2 
or SEQ ID NO:3; 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) 
comprising a polymorphic variation; and 

(e) a nucleotide sequence complementary to the nucle- 
otide sequences of (a), (b), (c), or (d); 

wherein the P2X4 nucleotide sequence comprises one or 
more polymorphic variations selected from the group 

consisting of thymine at position 14744, an adenine at 
position 14977, a cytosine at position 15046, a thymine 
at position 21538, a guanine at position 21630, and a 
single nucleotide deletion at position 25264 of SEQ ID 
NO:l. 

34. An oligonucleotide which comprises a nucleotide 
sequence complementary to a portion of a P2X4 nucleotide 
sequence selected irom the group consisting of: 

(a) the nucleotide sequence of SEQ ID NO:l; 

(b) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO:2 
or SEQ ID NO:3; and 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3; 

wherein the 3' end of the oligonucleotide is adjacent to a 
polymorphic variation in the P2X4 nucleotide 
sequence. 

35. The oligonucleotide of claim 34, which consists of a 
nucleotide sequence selected from the group consisting of 
AAAAAAAGACCGAACAC (SEQ ID NO: 5), CATT- 

AGCTCGGTCTCC (SEQ ID NO: 6), 
GCTTTCAACGGGTC (SEQ ID NO: 7), TAGCTGGGAF- 
TACAGAC (SEQ ID NO: 8), CATGTCCTGGAAAC (SEQ 


ID NO: 9), GCCTTCTGCCTTGTA (SEQ ID NO: 10), 
CCTCAAAAAAAGACCGAACAC (SEQ ID NO: 11), 
GCGTAGCTTTCAACGGGTC (SEQ ID NO: 12), ACG- 
GCCATGTCCTGGAAAC (SEQ ID NO: 13), GAA- 
GAAAAGACACAGAGACAC (SEQ ID NO: 14), 
TGGCAGGATGTTTCTCCTGT (SEQ ID NO: 15), 
AGGATCCGGAAAGAGTTTCC (SEQ ID NO: 16), 
CAGCTTTCTACTATATTCATGTAA (SEQ ID NO: 17), 
GCTCATIAGCTCGGTCTCC (SEQ ID NO: 18), 
GGAAAAACACGAAAGGCTTAGTT (SEQ ID NO: 19), 
TCAGGCATGTGATCAGTTAGAGCC (SEQ ID NO: 20), 
and CCTTCCCACTGAGCACATC (SEQ ID NO: 21). 

36. Amicroarray comprising an isolated nucleic acid of 
claim 33 linked to a solid support. 

37. An isolated polypeptide encoded by the isolated 
nucleic acid sequence of claim 33. 

38. A method for identifying a candidate therapeutic for 
fat reduction, which comprises: 

(a) introducing a test molecule to a system which com- 
prises a nucleic acid comprising a P2X4 nucleotide 
sequence selected from the group consisting of: 

(i) the nucleotide sequence of SEQ ID NO:l; 

(ii) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID 
NO:2 or SEQ ID NO:3; 

(iii) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:3; and 

(iv) a fragment of a nucleotide sequence of (i), (ii), or 

(iii); or 

introducing a test molecule to a system which comprises 
a protein encoded by a nucleotide sequence of (i), (ii), 
(iii), or (iv); and 

(b) determining the presence or absence of an interaction 
between the test molecule and the nucleic acid or 

protein, 

whereby the presence of an interaction between the test 
molecule and the nucleic acid or protein identifies the 
test molecule as a candidate therapeutic for fat reduc- 
tion. 

39. The method of claim 38, wherein the system is an 
animal. 

40. The method of claim 38, wherein the system is a cell. 

41. The method of claim 38, wherein the P2X4 nucleotide 
sequence comprises one or more polymorphic variations 
selected from the group consisting of a an adenine at 
position 11030, a thymine at position 11030, a thymine at 
position 15847, a cytosine at position 15847, a cytosine at 
position 17338, and a thymine at position 17338. 

42. A method for reducing fat deposition in a subject, 
which comprises administering a candidate therapeutic of 
claim 38 to a subject in need thereof, whereby the candidate 
therapeutic reduces fat deposition in the subject. 

43. A method for reducing fat deposition in a subject, 
which comprises contacting a P2X4 nucleic acid with one or 
more cells of a subject in need thereof, wherein the P2X4 
nucleic acid comprises a nucleotide sequence selected from 
the group consisting of: 
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(a) the nucleotide sequence of SEQ ID NO:l; 

(b) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO:2 
or SEQ ID NO:3; 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a nucleotide sequence of (i), (ii), or (iii); 

whereby contacting the one or more cells of the subject 
with the P2X4 nucleic acid reduces fat deposition. 

44. A method for reducing fat deposition in a subject, 
which comprises contacting a P2X4 protein with one or 
more cells of a subject in need thereof, wherein the P2X4 
protein is encoded by a P2X4 nucleotide sequence which 
comprises a polynucleotide sequence selected from the 
group consisting of: 

(a) the polynucleotide sequence of SEQ ID NO:l; 

(b) a polynucleotide sequence which encodes a polypep- 
tide consisting of the amino add sequence of SEQ ID 
NO:2 or SEQ ID NO:3; 

(c) a polynucleotide sequence whidi encodes a polypep- 
tide that is 90% identical to the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:3; and 

(d) a fragpaent of a polynucleotide sequence of (i), (ii), or 

whereby contacting the one or more cells of the subject 
with the P2X4 protein reduces faX deposition. 

45. A method for reducing fat deposition in a subject, 

which comprises: 

detecting the presence or absence of a polymorphic vari- 
ant associated with fat deposition in a P2X4 nucleotide 
sequence in a nucleic acid sample from a subject, 
wherein the P2X4 nucleotide sequence comprises a 
polynucleotide sequence selected from the group con- 
sisting of: 

(a) the polynucleotide sequence of SEQ ID NO:l; 

(b) a polynucleotide sequence which encodes a 
polypeptide consisting of the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:3; 

(c) a polynucleotide sequence which encodes a 
polypeptide that is 90% identical to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a polynucleotide sequence of (i), (ii), 
or (iii); and 

administering a treatment that reduces fat deposition to a 
subject from whom the sample originated where the 
presence of a polymorphic variation associated with fat 
reduction is detected in the P2X4 nucleotide sequence. 

46. The method of claim 45, wherein the polymorphic 
variation is selected from the group consisting of an adenine 
at position 11030, a thymine at position 15847, and a 
cytosine at position 17338. 

47. The method of claim 45, wherein the treatment is one 
or more selected from the group consisting of an appetite 
suppressant, a lipase inhibitor, a phospholipase inhibitor, an 
exercise regimen, a dietary regimen, psychological consol- 
ing, psychotherapy, and a psychotherapeutic. 


4S. A method for identifying a candidate therapeutic for 
alleviating NIDDM, which comrpiises: 

(a) introducii^ a test molecule to a system which com- 
prises a nucleic acid comprising a P2X4 nucleotide 
seqtience selected from the group consisting of: 

(i) the nucleotide sequence of SEQ ID NO: 1; 

(ii) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID 
NO:2 or SEQ ID NO:3; 

(iii) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:3; and 

(iv) a fragment of a nucleotide sequence of (i), (ii), or 
(iii); or 

introducing a test molecule to a system which comprises 
a protein encoded by a nucleotide sequence of (i), (ii), 
(iii), or (iv); and 

(b) determining the presence or absence of an interaction 
between the test molecule and the nucleic acid or 
protein, 

whereby the presence of an interaction between the test 
molecule and the nucleic acid or protein identihes the 
test molecule as a candidate therapeutic for treating 
NIDDM. 

49. The method of claim 48, wherein the system is an 
animal. 

50. The method of claim 48, wherein the system is a cell. 

51. The method of claim 48, wherein the P2X4 nucleotide 
sequence comprises a thymine at position 15847 of SEQ ID 
NO:l. 

52. A method for treating NTDDM in a subject, which 
comprises administering a candidate therapeutic of claim 48 
to the subject in need thereof, whereby the candidate thera- 
peutic treats NIDDM in the subject. 

53. A method for alleviating NIDDM in a subject, which 
comprises contacting a P2X4 nucleic acid with one or more 
cells of a subject in need thereof, wherein the P2X4 nucleic 
acid comprises a nucleotide sequence selected from the 
group consisting of; 

(a) the nucleotide sequence of SEQ ID NO:l; 

(b) a nucleotide sequence which encodes a polypeptide 
consisting of the amino acid sequence of SEQ ID NO:2 
or SEQ ID NO:3; 

(c) a nucleotide sequence which encodes a polypeptide 
that is 90% identical to the amino acid sequence of SEQ 
ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a nucleotide sequence of (i), (if), or (iii); 

whereby contacting the one or more cells of the subject 
with the P2X4 nucleic acid alleviates NIDDM. 

54. A method for alleviating NIDDM in a subject, which 
comprises contacting a P2X4 protein with one or more cells 
of a subject in need thereof, wherein the P2X4 protein is 
encoded by a P2X4 nucleotide sequence which comprises a 
polynucleotide sequence selected from the group consisting 
of: 
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(a) the polynucleotide sequence of SEQ ID NO:l; 

(b) a polynucleotide sequence which encodes a polypep- 
tide consisting of the amino acid sequence of SEQ ID 
NO:2 or SEQ ID NO: 3; 

(c) a polynucleotide sequence which encodes a polypep- 
tide that is 90% identical to the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a polynucleotide sequence of (i), (ii), or 
(iii); 

whereby contacting the one or more cells of the subject 

with the P2X4 protein alleviates NIDDM. 
55. A method for alleviating NIDDM in a subject, which 
comprises: 

detecting the presence or absence of a polymorphic vari- 
ant associated with NIDDM in a F2X4 nucleotide 
sequence in a nucleic acid sample from a subject, 
wherein the P2X4 nucleotide sequence comprises a 
polynucleotide sequence selected from the group con- 
sisting of: 


(a) the polynucleotide sequence of SEQ ID NO:l; 

(b) a polynucleotide sequence which encodes a 
polypeptide consisting of the amino acid sequence of 
SEQ ID NO:2 or SEQ ID NO:3; 

(c) a polynucleotide sequence which encodes a 
poljrpeptide that is 90% identical to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:3; and 

(d) a fragment of a polynucleotide sequence of (i), (ii), 
or (iii); and 

administering a treatment that alleviates NIDDM to a 
subject from whom the sample originated where the 
presence of a polymorphic variation associated with 
NIDDM is detected in the P2X4 nucleotide sequence. 

56. The method of claim 55, wherein the polymorphic 
variation is a thymine at position 15847 of SEQ ID NO:l. 

57. The method of claim 55, wherein the treatment is one 
or more selected from the group consisting of insuhn, a 
hypoglycemic, a starch blocker, a liver glucose regulating 
agent, an insulin sensitizer, a glucose level monitoring 
regimen, dietary counseling, and a dietary regimen for 
managing blood glucose levels. 


